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(2) INFORMATION FOR SEQ ID NO: 1: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5392 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: CDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: S . pristi naespi ral i s 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1: 



GGATCCTGGC 


GTCCGCCGTC 


AAGAACTGAA 


CCGAGGAGAC 


ACCCACCATG 


ACCGCACCCC 


60 


GCCGGCGCAT 


CACCCTCGCC 


GGCATCATCG 


ACGGCCCCGG 


CGGCCATGTG 


GCCGCCTGGC 


120 


GCCACCCGGC 


GACCAAGGCG 


GACGCCCAGC 


TCGACTTCGA 


ATTCCACCGC 


GACAACGCCC 


180 


GCACCCTCGA 


ACGCGGCCTG 


TTCGACGCC6 


TGTTCATCGC 


GGACATCGTC 


GCCGTGTGGG 


240 


GCACCCGCCT 


GGACTCCCTG 


TGCCGCACCT 


CGCGCACCGA 


GGVCTTCGAA 


CCGCTCACCC 


300 


TGCTCGCCGC 


CTACGCCGCG 


GTCACCGAGC 


ACATCGGCCT 


GTGCGCCACC 


GCCACCACCA 


360 


CGTACAACGA 


ACCGGCGCAC 


ATCGCCGCCC 


GCTTCGCCTC 


CCTCGACCAC 


CTCAGCGGCG 


420 


GCCGGGCCGG 


CTGGAACGTC 


GTCACCTCCG 


CCGCACC6TG 


GGAGTCCGCC 


AACTTCGGCT 


480 


TCCCCGAGCA 


CCTGGAGCAC 


GGCAAACGCT 


ACGAGCGGGC 


CGAGGAGTTC 


ATCGACGTCG 


540 


TCAAAAAACT 


GTGGGACAGC 


GACGGCCGCC 


CCGTCGACCA 


CCGCGGCACC 


cacttcgagg" 


600 


CCCCCGGCCC 


GCTCGGGATC 


GCCCGCCCCC 


CGCAGGGCCG 


CCCCGTCATC 


ATCCAGGCCG 


660 


GCTCCTCGCC 


GGTGGGACGC 


GAGTTCGCCG 


CCCGGCACGC 


CGAGGTCATC 


ttcacccggc 


720 


ACAACCGGCT 


CTCCGACGCC 


CAGGACTTCT 


ACGGCGACCT 


CAAGGCACGC 


gtcgcccggc 


780 


ACGGCCGCGA 


CCCCGAGAAG 


GTCCTCGTGT 


GGCCGACCCT 


CGCGCCGATC 


gtcgccgcca 


840 


CCGACACCGA 


GGCGAAGCAG 


CGCCTGCAGG 


AACTGCAGGA 


CCTCACCCAC 


gaccatgtcg 


900 


CCCTGCGCAC 


CCTTCAGGAC 


CACCTC6GCG 


ACGTCGACCT 


GAGCGCGTAC 


ccgatcgacg 


960 


GGCCCGTCCC 


CGACATCCCG 


TACACCAACC 


AGTCCCAGTC 


GACGACCGAG 


CGGCTGATCG 


1020 


GCCTGGCCAG 


GCGCGAGAAC 


CTCAGCATCC 


GCGAGCTGGC 


CCTGCGGCTG 


AT66GCGACA 


1080 


TCGTCGTCGG 


CACACCGGAG 


CAGCTCGCCG 


ACCACATGGA 


GAGCTGGTTC 


ACCGGCCGCG 


1140 


GCGCCGACGG 


CTTCAACATC 


GACTTCCCGT 


ACCTGCCGGG 


CTCCGCCGAC 


GACTTCGTCG 


1200 


ACCACGTGGT 


GCCCGAACTG 


CAGCGCCGCG 


GCCTGTACCG 


CTCGGGCTAC 


GAGGGCACCA 


1260 


CCCTGCGGGC 


CAACCTCGGC 


ATCGACGCCC 


CCCGGAAGGC 


AGGTGCAGCG 


GCTTGACTTC 


1320 



Page 2 



03806.0054-04. seql St 



CGTCCTAAAG 


GCGGGGGATT 




1 UV3V3V3 


\ \ KX. I V3V. 1 1 V. 


AV.V.V3AV.V3AV.V. 


X30U 


GCCCCGTCCG 


GGAGGACTCC 




TTAXA ccfirrc 

1 1 M 1 Al.^\3 1 v. 


\ v.w\v>\VaV3V.v. 


V3Av.V3V.V.V3V.V^ 


XH*fVi 


GCCCGGCGGC 


CAGGATGTTG 






V3 1 V-A 1 V3\»AV^ 


V3L.V3LV.V3V.AV3 1 


1 Rfin 


CGCACGTCCA 


CTCCCGGACG 




fit ' 1 TCCCCrT' 


V3AV.V.V3 1 UV.^V3 


LAV3V3 1 1 V.LV3V. 


1 ^<;n 


ACAGCTT6GA GCTGGGGAAC 


/" A rT'CC^CC A 


\ v>\L.uAV.V3Avs 


1 1 V-V3UV3LV.CA 


1 AL.LAV3V3Lv3L 


1 C5n 
lozu 


ACTTGTACTC 


CAGCATGGAG 






Va 1 v-VaVaAlaA 1 Va 


VaLVaLCaCVaLvaA 


1 ^^QH 
IDOU 


GCTTGCCGTT 


CTTCAGCAGG 




TC A rcvrcrr 
\ vjAGVa 1 1 1» 


KTr" Krr f^rT" 
VaA 1 v>\V.V3AV.\- 


t — 1 — I — Tr't — I — TC"T 

Va 1 1 1 vaVa 1 1 L 1 


X/4U 


CACGGACGAG 


TCGAGTCGAC 


AUl. 1 1 0 1 uvjA 


/~/^ A A r-rfff A 


UV.V3V.V.V1U 1 \Ja 


V3 1 VaA 1 CCVavaG 


XoUU 


CGTGGACGCG 


GGCGACCTTG 




Tr'TTTTTT'TT 


V.V3V.V.V3AV.V.V-V. 


1 1 V.V3L,v.TT V3L. 


J.OOU 


GCGACACGTC 


CCGCTGAGCC 


\ \ L.vjk.V3Auviv. 


l3V3V3L.l3V.V3V3 1 V. 


AV.V3V3V.V3L 1 V.V3 


iVaL. TTV3GGGT 




TGGTGATCTT 


CTCCCCGGTG 


/■ A ^ A /"/"/"T/" A 

laACJXCa^laTCA 


V-CAGGGAGGT 


GATCCCGGCG 


TCGATGLCGA 




CGGCCGCCGT 


GGTGGCGGGC 


oCouiauil 1 13A 


I V3V3 I V3 T Q.V-TV- 


V3LA\-AuLAv3U 


GACACGAALC 




AGCGGCCCGC 


ACGGTCGCGG 


\3AUAV.OU 1 \J\ 


V.V.V3 1 V.O 1 I.V3V3 


V-Tv-LVaCLCC T 


TCGGGAAVsGG 




GACGGGACCA 


GCGGATGTCC 


A fffff^CTT' 
Al3UO\3\. 1 \Jm 


k.Vjl3 1 V. 1 1 V.V3V. 


v^XvaLviTVaAvav. 


TvaTLCG TTAL. 


^1 Cits 


GCCACGTGAA 


GGCGCTGCGG 


VJ 1 0 1 AV- 1 




VaVaAL 1 1 M 1 L 


fffT' A / — 1 — r/ — r 
LVaLVaAL ITVaT 


LLLX) 


ACCGCGGGTA 


CTTCGACCGC 


"TTrrcc A A A 
1 1 oov^oAAoA 


A/~"n"/~^/"^ A A 

AV3 1 1 VaVaV-VaAA 


LVaLLVa 1 L 1 VaL 


AAvaTvaLLVaLA 


*5'5Qn 


GCGCCTGCTG 


GAGCGGGACG 


(jACaCaAV-Av-LT 


CCvaACiAGGAA 


GGCGAGTTCT 


TCGG 1 L 1 1 L 1 


"5 3 /I A 


TCCACTCCGT 


CAGCGCGGCG 


V3AL.V1AV. 1 \j\J\ 


V.U 1 AvaVsAVaAv. 


LLvavaLVaLTVaL 


TlGCCGTACC 




AGGCTCGCGT 


GCGCCCCTCA 


A C ffffTTfT 

MaxMKX. 1 1 V3 1 


1 V3 1 AV^V.V3AVa 


VsLvaVaACALAva 


LLvaAACGTGC 


Z4oU 


GGGACAGCTC 


AGCCGCCTGC 


\ V.V3 1 \J>M \ V3V3 


V3A 1 AAAAV3L.V3 


/~T A 1 1 /* AAA 

Va 1 AL 11 vaAAA 


GLCCGL rivaA 


■) c*5n 


CCTGCTGCAT 


CACGCCTCAC 


A\.i3L. 1 A 1 


\ \ V»UV.V3 1 V3 1 V3 


AV3V.\3V3LV3V3V3T 


V3TC T GCCGG T 




GGTTGCAGAC 


GCCGAACCGC 


ffTfrT'fr'r'r' 


ATTQ,V3V-L.W\T 


V-V-CT GCV.CTG 


CTCCGCAAGA 




GCTTCGTCTC 


CTCCCCGGTC 


TvaAAvauCCuCa 


ff^TKTr'^ A 

uGTATCtACG 


A A A A TT^T 

AAGGAATT CT 


^ AT"/" A /■/■/■/"/■ 

GATGACCGCG 


^/UU 


CCCATCCTCG 


TCGCCACCCT 


CGACACCCGC 


GGCCCCGCCG 


CCACCC T CGG 


CACGATCACC 




CGCGCCGTGC 


GGGCCGCGGA 


GGCCGCCGGA 


TTCGACGCCG 


TCCTGATCGA 


CGACCGGGCC 




GCCGCCGGCG 


TCCAGGGCCG 


Ca IT CG AG ACG 


ACGACGC T GA 


CLGCCGCGCT 


GGCCGCCGTL 


OQQA 

ZooU 


ACCGAGCACA 


TCGGCCTGAT 


CACCGCCCCG 


CTCCCGGCCG 


ACCAGGCCCC 


CTACCACGTG 


2940 


TCCCGGATCA 


CCGCCTCGCT 


CGACCACCTC 


GCCCACGGCC 


GCACCGGCTG 


GCTCGCGAGC 


3000 


ACGGACACCA 


CCGACCCCGA 


GGGCCGCACC 


GGCGAACTCA 


TCGACGTCGT 


CCGCGGCCTG 


3060 


TGGGACAGCT 


TCGACGACGA 


CGCCTTCGTC 


CACGACCGCG 


CCGACGGCCT 


GTACTGGCGG 


3120 


CTGCCCGCCG 


TCCACCAACr 


CGACCACCAG 


GGCAGGCACT 
Page 


TCGACGTGGC 
3 


CGGCCCCCTC 


3180 
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AACGTCGCCC 


GCCCGCCGCA 


GGGCCACCCC 


GTCGTCGCCG 


TCACCGGCCC 


CGCCCTCGCC 


3240 


GCGGCCGCCG 


ACCTCGTCCT 


GCTCGACGAG 


GCGGCCGACG 


CCGCCTCGGT 


GAAGCAGCAG 


3300 


GCACCGCACG 


CCAAGATCCT 


CCTGCCGCTG 


CCCGGCCCGG 


CCGCCGAACT 


GCCCGCCGAC 


3360 


AGCCCCGCGG 


ACGGCTTCAC 


GGTGGCGCTC 


ACCGGCTCCG 


ACGACCCGGT 


CCTGGCCGCG 


3420 


CTCGCC6CCC 


GGCCCGGCCG 


CCCGGACCGC 


ACCGCGGCCA 


CCACCCTGCG 


CGAACGCCTG 


3480 


GGCCTGGCCC 


GCCCCGA6AG 


CCGCCACGCC 


CTCACCACCG 


CCTGACGACC 


CGTCC6CCCG 


3540 


CTGCTTCCTG 


GAGAGTCATG 


TCCCGTCGCC 


TGTTCACCTC 


GGAGTCCGTG 


ACCGAGGGCC 


3600 


ACCCCGACAA 


GATCGCCGAC 


CAGATCAGTG 


ACACCGTCCT 


CGACGCCCTG 


CTGCGCGAGG 


3660 


ACCCCGCCTC 


ACGCGTCGCG 


GTCGAGACCC 


TGATCACCAC 


CGGCCAGGTC 


CACATCGCCG 


3720 


GCGAGGTCAC 


CACCAAGGCG 


TACGCGCCCA 


TCGCCCAACT 


GGTCCGCGAC 


ACGATCCTGG 


3780 


CCATCGGCTA 


CGACTCGTCC 


GCCAAGGGCT 


TCGACGGCGC 


CTCCTGCGGC 


GTCTCCGTCT 


3840 


CCATCGGCGC 


GCAGTCCCCG 


GACATCGCCC 


AGGGCGTCGA 


CAGCGCCTAC 


GAGACCCGCG 


3900 


TCGAGGGCGA 


GGACGACGAG 


CTCGACCAGC 


AGGGCGCCGG 


CGACCAGGGC 


CTGATGTTCG 


3960 


GCTACGCCAC 


CGACGAGACC 


CCCTCGCTGA 


TGCCGCTGCC 


CATCGAGCTC 


GCCCACCGCC 


4020 


TCTCGCGCCG 


GCTCACCGAG 


GTCCGCAAGG 


ACGGCACCGT 


CCCCTACCTG 


CGCCCCGACG 


4080 


GCAAGACCCA 


GGTCACCATC 


GAGTACCAGG 


GCAGCCGCCC 


GGTGCGCCTG 


GACACCGTCG 


4140 


TCGTCTCCTC 


CCAGCACGCC 


GCCGACATCG 


ACCTCGGCTC 


CCTGCTCACC 


CCCGACATCC 


4200 


GCGAGCACGT 


CGTCGAGCAC 


GTCCTCGCCG 


CACTCGCCGA 


GGACGGCATC 


AAGCTCGAGA 


4260 


CGGACAACTA 


CCGCCTGCTG 


GTCAACCCGA 


CCGGCCGTTT 


CGAGATCGGC 


GGCCCGATGG 


4320 


GCGACGCCGG 


CCTGACCGGC 


CGCAAGATCA 


TCATCGACAC 


GTACGGCGGC 


ATGGCCCGCC 


4380 


ACGGCGGTGG 


CGCGTTCTCC 


GGCAAGGACC 


CGTCCAAGGT 


CGACCGTTCC 


GCCGCGTACG 


4440 


CGATGCGCTG 


GGTCGCCAAG 


AACGTCGTCG 


CCGCGGGCCT 


CGCCTCCCGC 


TGCGAGGTCC 


4500 


AGGTCGCCTA 


CGCCATCGGC 


AAGGCCGAGC 


CGGTCGGCCT 


GTTCGTCGAG 


ACGTTCGGCA 


4560 


CCGGCACCGT 


CGCCCAGGAG 


CGCATCGAGA 


AGGCCATCAC 


CGAGGTCTTC 


GACCTGCGCC 


4620 


CCGCGGCCAT 


CATCCGCGAC 


CTCGACCTGC 


TGCGGCCCAT 


CTACGCCGCC 


ACCGCCGCCT 


4680 


ACGGCCACTT 


CGGCCGCGAA 


CTGCCCGACT 


TCACCTGGGA 


GCGGACCGAC 


CGCGCCCACC 


4740 


GGCTCAAGGC 


CGCGGCCGGT 


CTCTGAGCCG 


GCCGGACCTG 


TGAGGAGACC 


TGACGTGCGC 


4800 


ATCGCTGTCA 


CCGGTTCCAT 


CGCCACCGAC 


CATCTGATGG 


TCTTCCCCGG 


CCGGTTCGCG 


4860 


GATCAGCTGA 


TCCCCGACCA 


GCTCGCTCAT 


GTCTCGCTCT 


CCTTCCTGGT 


CGACGCACTC 


4920 


GAGGTGCGCC 


GGGGCGGAGT 


GGCGGACAAC 


GTCGCCTTCG 


GCCTCGGCGG 


CCTCGGCCTC 


4980 


ACCCCCCAGC 


TGGTCGGCGC 


CGTG6GCAGC 


GACTTCGCCG 


AGTACGAGGT 


CTGGCTCAAG 


5040 


GAACACGGCG 


TCGACACCGG 


CCCCGTCCTG 


GTCTCCACCG 

Page 


AGCGGCAGAC 
4 


CGCCCGGTTC 


5100 
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ATGTGCATCA CCGACCAGGA CTCCAACCAG ATCGCCTCCT TCTACGCGGG CGCCATGCAA 5160 

GAGGCCCGCG AGVTCGACCT GTGGCACCTG ACCACCGGCA GCGTCCGCCC CGACCTCGTC 5220 

CTGGTCTGCC CGAACGACCC GGCGGCGATG CTGCGCCACA CGGGGAGTGC CGCGAAACTG 5280 

GGCCTGCCGT TCGCCGCCGA CCCCTCCCAG CAGCTCGCCC GCCTGGAGGG AGGGAGGTAC 5340 

GCGAACTCGG TCGACGGGGC CCGTTGGTTT TTCACCAACG AAGTACGAGG CC 5392 

(2) INFORMATION FOR SEQ ID NO: 2: 

Ci) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1268 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) topology: linear 

(ii) MOLECULE TYPE: CDNA 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 
(Vi) ORIGINAL SOURCE: 

(A) ORGANISM: S . pri sti naespi ral i s 

Cix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..1268 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

ATG ACC GCA CCC CGC CGG CGC ATC ACC CTC GCC GGC ATC ATC GAC GGC 48 
Met Thr Ala Pro Arg Arg Arg lie Thr Leu Ala Gly lie lie Asp Gly 
15 10 15 

CCC GGC GGC CAT GTG GCC GCC TGG CGC CAC CCG GCG ACC AAG GCG GAC 96 
Pro Gly Gly His val Ala Ala Trp Arg His Pro Ala Thr Lys Ala Asp 
20 25 30 

GCC CAG CTC GAC TTC GAA TTC CAC CGC GAC AAC GCC CGC ACC CTC GAA 144 

Ala Gin Leu Asp Phe Glu Phe His Arg Asp Asn Ala Arg Thr Leu Glu 
35 40 45 

CGC GGC CTG TTC GAC GCC GTG TTC ATC GCG GAC ATC GTC GCC GTG TGG 192 
Arg Gly Leu Phe Asp Ala val Phe lie Ala Asp lie val Ala val Trp . 
50 55 60 

GGC ACC CGC CTG GAC TCC CTG TGC CGC ACC TCG CGC ACC GAG CAC TTC 240 

Gly Thr Arg Leu Asp Ser Leu Cys Arg Thr Ser Arg Thr Glu His Phe 
65 70 75 80 

GAA CCG CTC ACC CTG CTC GCC GCC TAC GCC GCG GTC ACC GAG CAC ATC 288 

Glu Pro Leu Thr Leu Leu Ala Ala Tyr Ala Ala val Thr Glu His lie 
85 90 95 

GGC CTG TGC GCC ACC GCC ACC ACC ACG TAC AAC GAA CCG GCG CAC ATC 336 

Gly Leu Cys Ala Thr Ala Thr Thr Thr Tyr Asn Glu Pro Ala His lie 
100 105 110 

Page 5 



Ala 


GCC 

Ala 


CoC 

Arg 
115 


11/- 

1 rc 

Phe 


CCC 

GCC 

Ala 


TCC 

Ser 


CTC 

C 1 c 

Leu 


C AC 

Asp 
120 
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CAC CTC AGC GGC GGC CGG 
His Leu ser Gly Gly Arg 
125 


CCC 

GCC 

Ala 


CCC 

GGC 

Gly 




Trp 


Asn 
130 


val 


val 


Thr 


TCC 

Ser 


CCC 

Ala 
135 


CCL. 

Ala 


CCG 
Pro 


TGG 
Trp 


GAG 
Glu 


TCC 

Ser 
140 


GCC 
Ala 


AAC 

Asn 


TTC 

Phe 


CCC 

Gly 




TTC 

Phe 
145 


CCC 
Pro 


GAG 
Glu 


CAC 

His 


CTG 
Leu 


GAG 
Glu 
150 


CAC 
His 


GGC 

Gly 


AAA 

Lys 


CGC 
Arg 


TAC 

Tyr 
155 


GAG 
Glu 


CGG 
Arg 


GCC 

Ala 


GAG 
Glu 


GAG 
Glu 
160 


480 


TTC 

Phe 


ATC 

lie 


GAC 
Asp 


GTC 

val 


GTC 

val 
165 


AAA 
Lys 


AAA 
Lys 


CTG 
Leu 


TGG 

Trp 


GAC 
Asp 

170 


AGC 

ser 


GAC 
ASp 


GGC 

Gly 


CGC 

Arg 


CCC 

Pro 
175 


GTC 

val 


528 


Asp 


His 


Arg 


CCC 

Gly 
180 


ACC 

Thr 


C AC 

His 


TTC 

Phe 


C AC 

Glu 


GCC 

Ala 
185 


CCC 
Pro 


GGC 
Gly 


CCG 
Pro 


CTC 
Leu 


GGG 

Gly 
190 


ATC 

He 


CCC 

Ala 


j/O 


CGC 

Arg 


CCC 

Pro 


CCG 
Pro 
195 


CAG 

Gin 


GGC 
Gly 


CGC 

Arg 


CCC 

Pro 


GTC 
val 
200 


ATC 

He 


ATC 

He 


CAG 
Gin 


GCC 
Ala 


GGC 
Gly 
205 


TCC 

Ser 


TCG 

Ser 


CCG 

Pro 


624 


V3 1 Kl 

val 


Gly 
210 


CGC 

Arg 


C AC 

GAG 
Glu 


TTC 

Phe 


CCC 

Ala 


CCC 

Ala 
215 


CCC 

CGG 

Arg 


CAC 

His 


GCC 

Ala 


GAG 
Glu 


GTC 

val 
220 


ATC 
He 


TTC 

Phe 


ACC 

Thr 


CCC 

CGG 

Arg 


X>i C 


\J\\^ 
His 
225 


AAL. 

Asn 


Arg 


CTC 

Leu 


TCC 
i 

Ser 


C AC 

Asp 

230 


CCC 

Ala 


C AC 

Gin 


GAC 
Asp 


TTC 
Phe 


TAC 
Tyr 
235 


GGC 
Gly 


GAC 
Asp 


CTC 

Leu 


A A /" 

AAU 

Lys 


GCA 

Ala 
240 


i L\J 


Arg 


err 

val 


Ala 


CCC 

Arg 


C AC 

His 
245 


CCC 

Gly 


CCC 

Arg 


C AC 

Asp 


CCC 

Pro 


GAG 
Glu 
250 


AAG 

Lys 


GTC 
val 


CTC 

Leu 


GTG 

val 


TCC 

Trp 
255 


CCC 

Pro 


/Do 


ACC 

Thr 


CTC 

Leu 


GCG 

Ala 


CCG 

Pro 
260 


ATC 

He 


GTC 

val 


GCC 

Ala 


GCC 

Ala 


ACC 
Thr 
265 


GAC 
Asp 


ACC 

Thr 


GAG 
Glu 


GCG 
Ala 


AAG 

Lys 
270 


CAG 

Gin 


CGC 

Arg 


olb 


CTG 
Leu 


CAG 
Gin 


GAA 
Glu 
275 


CTG 
Leu 


CAG 

Gin 


GAC 
Asp 


CTC 

Leu 


ACC 

Thr 
280 


CAC 
His 


GAC 
Asp 


CAT 
His 


GTC 

val 


GCC 
Ala 
285 


CTG 

Leu 


CGC 
Arg 


ACC 

Thr 


864 


1 1 

Leu 


Gin 
290 


C AC 

ASp 


C AC 

His 


CTC 

Leu 


CCC 

Gly 


C AC 

oA^ 
Asp 

295 


CTC 

val 


GAC 
Asp 


CTG 
Leu 


AGC 

ser 


GCG 
Ala 
300 


TAC 
Tyr 


CCG 
Pro 


ATC 

ATC 

He 


C AC 

GAC 
Asp 




Gly 
305 


CCC 

Pro 


GTC 

val 


CCC 

Pro 


GAC 
Asp 


ATC 

He 
310 


CCG 

Pro 


TAC 

Tyr 


ACC 

Thr 


AAC 

Asn 


CAG 

Gin 
315 


TCC 

Ser 


CAG 

Gin 


TCG 

Ser 


ACG 

Thr 


ACC 

Thr 
320 


OCA 


GAG 
GlU 


CGCa 

Arg 


CTC 

Leu 


A 1 L 

He 


CCC 

Gly 
325 


C 1 G 

Leu 


f f 

GCC 

Ala 


ACC 

Arg 


Arg 


GAG 
Glu 
330 


AAC 
Asn 


CTC 

Leu 


Ser 


r\ 1 v» 

He 


CGC 

Arg 
335 


GAG 
Glu 


lUUo 


CTG 

Leu 


GCC 
Ala 


CTG 
Leu 


CGG 
Arg 
340 


CTG 
Leu 


ATG 

Met 


GGC 
Gly 


GAC 
Asp 


ATC 
He 
345 


GTC 
val 


GTC 

val 


GGC 
Gly 


ACA 
Thr 


CCG 
Pro 
350 


GAG 
Glu 


CAG 
Gin 


1056 


CTC 

Leu 


GCC 

Ala 


GAC 
Asp 
355 


CAC 
His 


ATG 

Met 


GAG 
Glu 


AGC 
ser 


TGG 

Trp 
360 


TTC 

Phe 


ACC GGC 
Thr Gly 

Page 


CGC 
Arg 

6 


GGC 
Gly 
365 


GCC 
Ala 


GAC 
ASp 


GGC 

Gly 


1104 
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TTC AAC ATC GAC TTC CCG TAG CTG CCG GGC TCC GCC GAG GAG TTC GTG 1152 

Phe Asn lie Asp Phe Pro Tyr Leu Pro Gly Ser Ala Asp Asp Phe val 
370 375 380 

GAC CAC GTG GTG CCC GAA CTG CAG CGC CGC GGC CTG TAG CGC TCG GGC 1200 

Asp His val Val Pro Glu Leu Gin Arg Arg Gly Leu Tyr Arg Ser Gly 
385 390 395 400 

TAG GAG GGC ACC ACC CTG CGG GCC AAC CTG GGC ATC GAC GCC CCC CGG 1248 
Tyr Glu Gly Thr Thr Leu Arg Ala Asn Leu Gly lie Asp Ala Pro Arg 
405 410 415 

AAG GCA GGT GCA GCG GCT TG 1268 
Lys Ala Gly Ala Ala Ala 
420 

(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 833 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: CDNA 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 
(Vi) ORIGINAL SOURCE: 

(A) ORGANISM: s . pri sti naespi ral i s 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1. .833 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

ATG ACC GCG CCC ATC CTG GTC GCC ACC CTC GAC ACC CGC GGC CCC GCC 48 

Met Thr Ala Pro lie Leu val Ala Thr Leu Asp Thr Arg Gly Pro Ala 
15 10 15 

GCC ACC CTC GGC ACG ATC ACC CGC GCC GTG CGG GCC GCG GAG GCC GCC 96 

Ala Thr Leu Gly Thr lie Thr Arg Ala Val Arg Ala Ala Glu Ala Ala 
20 25 30 

GGA TTC GAC GCC GTC CTG ATC GAC GAC CGG GCC GCC GCC GGC GTC CAG 144 

Gly Phe Asp Ala val Leu lie Asp Asp Arg Ala Ala Ala Gly val Gin 
35 40 45 

GGC CGG TTC GAG ACG ACG ACG CTG ACC GCC GCG CTG GCC GCC GTC ACC 192 
Gly Arg Phe Glu Thr Thr Thr Leu Thr Ala Ala Leu Ala Ala val Thr 
50 55 60 

GAG CAC ATC GGC CTG ATC ACC GCC CCG CTC CCG GCC GAC CAG GCC CCC 240 
Glu His lie Gly Leu lie Thr Ala Pro Leu Pro Ala Asp Gin Ala Pro 
65 70 75 80 

TAC CAC GTG TCC CGG ATC ACC GCC TCG CTC GAC CAC CTC GCC CAC GGC 288 

jyr His val Ser Arg lie Thr Ala Ser Leu Asp His Leu Ala His Gly 
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85 90 95 

CGC ACC GGC TGG CTC GCG AGC ACG GAC ACC ACC GAC CCC GAG GGC CGC 336 
Arg Thr Gly Trp Leu Ala Ser Thr Asp Thr Thr Asp Pro Glu Gly Arg 
100 105 110 

ACC GGC GAA CTC ATC GAC GTC GTC CGC GGC CTG TGG GAC AGC TTC GAC 384 

Thr Gly Glu Leu lie Asp val val Arg Gly Leu Trp Asp Ser Phe Asp 
115 120 125 

GAC GAC GCC TTC GTC CAC GAC CGC GCC GAC GGC CTG TAC TGG CGG CTG 432 

Asp Asp Ala Phe Val His Asp Arg Ala Asp Gly Leu Tyr Trp Arg Leu 
130 135 140 

CCC GCC GTC CAC CAA CTC GAC CAC CAG GGC AGG CAC TTC GAC GTG GCC 480 

Pro Ala val His Gin Leu Asp His Gin Gly Arg His Phe Asp val Ala 
145 ^ 150 155 160 

GGC CCC CTC AAC GTC GCC CGC CCG CCG CAG GGC CAC CCC GTC GTC GCC 528 
Gly Pro Leu Asn val Ala Arg Pro Pro Gin Gly His Pro val val Ala 
165 170 175 

GTC ACC GGC CCC GCC CTC GCC GCG GCC GCC GAC CTC GTC CTG CTC GAC 576 

Val Thr Gly Pro Ala Leu Ala Ala Ala Ala Asp Leu val Leu Leu Asp 

180 185 190 

GAG GCG GCC GAC GCC GCC TCG GTG AAG CAG CAG GCA CCG CAC GCC AAG 624 
Glu Ala Ala Asp Ala Ala Ser Val Lys Gin Gin Ala Pro His Ala Lys 
195 200 205 

ATC CTC CTG CCG CTG CCC GGC CCG GCC GCC GAA CTG CCC GCC GAC AGC 672 
lie Leu Leu Pro Leu Pro Gly Pro Ala Ala Glu Leu Pro Ala Asp ser 
210 215 220 

CCC GCG GAC GGC TTC ACG GTG GCG CTC ACC GGC TCC GAC GAC CCG GTC 720 

Pro Ala Asp Gly Phe Thr Val Ala Leu Thr Gly Ser Asp Asp Pro val 
225 230 235 240 

CTG GCC GCG CTC GCC GCC CGG CCC GGC CGC CCG GAC CGC ACC GCG GCC 768 
Leu Ala Ala Leu Ala Ala Arg Pro Gly Arg Pro Asp Arg Thr Ala Ala 

245 250 255 

ACC ACC CTG CGC GAA CGC CTG GGC CTG GCC CGC CCC GAG AGC CGC CAC 816 
Thr Thr Leu Arg Glu Arg Leu Gly Leu Ala Arg Pro Glu Ser Arg His 
260 265 270 

GCC CTC ACC ACC GCC TG 833 

Ala Leu Thr Thr Ala 
275 

(2) INFORMATION FOR SEQ ID NO: 4: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1208 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: CDNA 
(iii) HYPOTHETICAL: NO 
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(iv) ANTI-SENSE: NO 
(vi) ORIGINAL SOURCE: 

(A) ORGANISM: S . pri sti naespi ral i s 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..1208 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 
ATG TCC CGT CGC CTG TTC ACC TCG GAG TCC GTG ACC GAG GGC CAC CCC 48 

Met Ser Arg Arg Leu Phe Thr Ser Glu Sen val Thr Glu Gly His Pro 
15 10 15 

GAC AAG ATC GCC GAC CAG ATC AGT GAC ACC GTC CTC GAC GCC CTG CTG 96 

Asp Lys l1e Ala Asp Gln i1e Ser Asp Thr va1 Leu Asp Ala Leu Leu 
20 25 30 

CGC GAG GAC CCC GCC TCA CGC GTC GCG GTC GAG ACC CTG ATC ACC ACC 144 

Arg Glu Asp Pro Ala Ser Arg val Ala Val Glu Thr Leu lie Thr Thr 
35 40 45 

GGC CAG GTC CAC ATC GCC GGC GAG GTC ACC ACC AAG GCG TAC GCG CCC 192 
Gly Gin Val His lie Ala Gly Glu val Thr Thr Lys Ala Tyr Ala Pro 
50 55 60 

ATC GCC CAA CTG GTC CGC GAC ACG ATC CTG GCC ATC GGC TAC GAC TCG 240 
lie Ala Gin Leu val Arg Asp Thr lie Leu Ala lie Gly Tyr Asp Ser 
65 70 75 80 

TCC GCC AAG GGC TTC GAC GGC GCC TCC TGC GGC GTC TCC GTC TCC ATC 288 

Ser Ala Lys Gly Phe Asp Gly Ala Ser Cys Gly val Ser val Ser lie 
85 90 95 

GGC GCG CAG TCC CCG GAC ATC GCC CAG GGC GTC GAC AGC GCC TAC GAG 336 

Gly Ala Gin Ser Pro Asp lie Ala Gin Gly val Asp Ser Ala Tyr Glu 
100 105 110 

ACC CGC GTC GAG GGC GAG GAC GAC GAG CTC GAC CAG CAG GGC GCC GGC 384 
Thr Arg val Glu Gly Glu Asp Asp Glu Leu Asp Gin Gin Gly Ala Gly 
115 120 125 

GAC CAG GGC CTG ATG TTC GGC TAC GCC ACC GAC GAG ACC CCC TCG CTG 432 

Asp Gin Gly Leu Met Phe Gly Tyr Ala Thr Asp Glu Thr Pro Ser Leu 
130 135 140 

ATG CCG CTG CCC ATC GAG CTC GCC CAC CGC CTC TCG CGC CGG CTC ACC 480 
Met Pro Leu Pro lie Glu Leu Ala His Arg Leu Ser Arg Arg Leu Thr 
145 150 155 160 

GAG GTC CGC AAG GAC GGC ACC GTC CCC TAC CTG CGC CCC GAC GGC AAG 528 

Glu val Arg Lys Asp Gly Thr Val Pro Tyr Leu Arg Pro Asp Gly Lys 
165 170 175 

ACC CAG GTC ACC ATC GAG TAC CAG GGC AGC CGC CCG GTG CGC CTG GAC 576 
Thr Gin Val Thr lie Glu Tyr Gin Gly Ser Arg Pro val Arg Leu Asp 
180 185 190 

ACC GTC GTC GTC TCC TCC CAG CAC GCC GCC GAC ATC GAC CTC GGC TCC 624 

Thr val val val ser ser Gin His Ala Ala Asp lie Asp Leu Gly Ser 
195 200 205 
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CTG CTC ACC CCC GAC ATC CGC GAG CAC GTC GTC GAG CAC GTC CTC GCC 672 

Leu Leu Thr Pro Asp lie Arg Glu His val val Glu His val Leu Ala 

210 215 220 

GCA CTC GCC GAG GAC GGC ATC AAG CTC GAG ACG GAC AAC TAC CGC CTG 720 

Ala Leu Ala Glu Asp Gly lie Lys Leu Glu Thr Asp Asn Tyr Arg Leu 

225 230 235 240 

CTG GTC AAC CCG ACC GGC CGT TTC GAG ATC GGC GGC CCG ATG GGC GAC 768 

Leu val Asn Pro Thr Gly Arg Phe Glu lie Gly Gly Pro Met Gly Asp 
245 250 255 

GCC GGC CTG ACC GGC CGC AAG ATC ATC ATC GAC ACG TAC GGC GGC ATG 816 

Ala Gly Leu Thr Gly Arg Lys lie lie lie Asp Thr Tyr Gly Gly Met 
260 265 270 

GCC CGC CAC GGC GGT GGC GCG TTC TCC GGC AAG GAC CCG TCC AAG GTC 864 

Ala Arg His Gly Gly Gly Ala Phe Ser Gly Lys Asp Pro Ser Lys val 
275 280 285 

GAC CGT TCC GCC GCG TAC GCG ATG CGC TGG GTC GCC AAG AAC GTC GTC 912 

Asp Arg Ser Ala Ala Tyr Ala Met Arg Trp Val Ala Lys Asn val val 
290 295 300 

GCC GCG GGC CTC GCC TCC CGC TGC GAG GTC CAG GTC GCC TAC GCC ATC 960 

Ala Ala Gly Leu Ala Ser Arg Cys Glu Val Gin val Ala Tyr Ala lie 

305 310 315 320 

GGC AAG GCC GAG CCG GTC GGC CTG TTC GTC GAG ACG TTC GGC ACC GGC 1008 

Gly Lys Ala Glu Pro val Gly Leu Phe val Glu Thr Phe Gly Thr Gly 
325 330 335 

ACC GTC GCC CAG GAG CGC ATC GAG AAG GCC ATC ACC GAG GTC^TTC GAC 1056 

Thr val Ala Gin Glu Arg lie Glu Lys Ala lie Thr Glu val Phe Asp 
340 345 350 

CTG CGC CCC GCG GCC ATC ATC CGC GAC CTC GAC CTG CTG CGG CCC ATC 1104 

Leu Arg Pro Ala Ala lie lie Arg Asp Leu Asp Leu Leu Arg Pro lie 
355 360 365 

TAC GCC GCC ACC GCC GCC TAC GGC CAC TTC GGC CGC GAA CTG CCC GAC 1152 

Tyr Ala Ala Thr Ala Ala Tyr Gly His Phe Gly Arg Glu Leu Pro Asp 
370 375 380 

TTC ACC TGG GAG CGG ACC GAC CGC GCC CAC CGG CTC AAG GCC GCG GCC 1200 
Phe Thr Trp Glu Arg Thr Asp Arg Ala His Arg Leu Lys Ala Ala Ala 

385 390 395 400 

GGT CTC TG 1208 

Gly Leu 



(2) INFORMATION FOR SEQ ID NO: 5: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1879 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE type: CDNA 
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(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 
(vi) ORIGINAL SOURCE: 

(A) ORGANISM: S.pH sti naespi rail s 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 110.. 18 58 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

GATCGGCTCC TGACGGAGCG GCG6CGCGCG GGCGCGGCGC ATCAGCGGCG TGTCAACGGC 60 

GCTGCCGACA CTGGGCGCGA CGC6AGGACG AAGCCGGAAA GGACCAACG ATG CTG 115 

Met Leu 
1 



GAC 
Asp 


GGA 

Gly 


TGC 

cys 
5 


GTT 

val 


ccc 
Pro 


TGG 
Trp 


ccc 
Pro 


GAG 
Glu 
10 


GAT 
Asp 


GTG 

val 


GCC 

Ala 


GCG 

Ala 


AAG 

Lys 
15 


TAC 

Tyr 


CGG 

Arg 


GCG 
Ala 


163 


GCC 

Ala 


GGC 

Gly 
20 


TAC 

Tyr 


TGG 
Trp 


CGG 
Arg 


GGC 

Gly 


GAG 
Glu 
25 


CCG 
Pro 


CTG 
Leu 


GGC 
Gly 


ATG 

Met 


CTG 

Leu 
30 


CTG 
Leu 


GGC 
Gly 


CGC 
Arg 


TGG 
Trp 


211 


GCG 

Ala 
35 


GAG 
Glu 


CAG 
Gin 


TAC 
Tyr 


GGC 
Gly 


GAG 
Glu 
40 


CGG 
Arg 


GAG 
Glu 


GCG 
Ala 


CTG 

Leu 


GTC 

val 
45 


GGC 
Gly 


GCG 

Ala 


GAC 
Asp 


GGG 
Gly 


TGC 
cys 
50 


259 


TCC 
Ser 


CGT 
Arg 


GTC 

val 


ACC 
Thr 


TAC 

Tyr 
55 


CGT 
Arg 


GCC 

Ala 


CTG 

Leu 


GAC 
Asp 


CGC 
Arg 
60 


TGG 
Trp 


TGC 

Cys 


GAC 
Asp 


CGG 

Arg 


CTG 

Leu 
65 


GCG 

Ala 


307 


GCG 

Ala 


GGG 

Gly 


TTC 

Phe 


GCG 

Ala 
70 


GCG 

Ala 


CGC 
Arg 


GGG 

Gly 


ATC 

He 


GGC 

Gly 
75 


GCC 

Ala 


GGC 

Gly 


GAG 
Glu 


CGG 
Arg 


GTG 

val 
80 


CTG 
Leu 


GTG 

val 


355 


CAG 
Gin 


CTG 

Leu 


CCG 
Pro 
85 


AAC 

Asn 


ACG 
Thr 


ccc 
Pro 


GAG 
Glu 


TTC 
Phe 
90 


GTC 
val 


GCG 
Ala 


GTG 

val 


TGC 

cys 


TTC 

Phe 
95 


GCG 
Ala 


CTG 
Leu 


TTC 

Phe 


403 


CGT 

Arg 


CTG 

Leu 
100 


GGC 

Gly 


GCG 
Ala 


CTG 
Leu 


CCG 
Pro 


GTG 

val 
105 


TTC 

Phe 


GCG 
Ala 


CTG 

Leu 


ccc 
Pro 


GCG 

Ala 
110 


CAC 
His 


CGT 
Arg 


GCC 
Ala 


GCC 

Ala 


451 


GAG 
Glu 
115 


GTG 

val 


GGG 
Gly 


CAC 

His 


CTG 

Leu 


CTC 

Leu 
120 


GAG 
Glu 


CTG 

Leu 


TCC 

Ser 


GGC 
Gly 


GCC 

Ala 
125 


GTC 

val 


GCC 
Ala 


CAC 
His 


ATC 

He 


CTG 

Leu 
130 


499 


CCG 

Pro 


GGC 
Gly 


ACC 

Thr 


GGC 

Gly 


ACC 

Thr 
135 


GGC 

Gly 


TAC 
Tyr 


GAC 
Asp 


CAT 

His 


GTC 

val 
140 


GCG 

Ala 


GCG 
Ala 


GCC 
Ala 


GTG 

val 


GAG 
Glu 
145 


GCC 

Ala 


547 


CGT 

Arg 


GCC 
Ala 


CGC 
Arg 


CGT 
Arg 
150 


GCC 

Ala 


CGC 
Arg 


CCG 
Pro 


GTG 
val 


CAG 
Gin 
155 


GTG 
val 


TTC 

Phe 


GTG 

val 


GCG 
Ala 


GGC 
Gly 
160 


GAG 
Glu 


GCG 

Ala 


595 


ccc 
Pro 


GCG 

Ala 


GTG 

val 
165 


CTG 

Leu 


ccc 
Pro 


GAG 
Glu 


GGG 
Gly 


TTC 

Phe 
170 


ACC 

Thr 


GCG CTG 

Ala Leu 
Page 


GCC 
Ala 

11 


GAC 
Asp 
175 


GTG 

val 


GAC 
Asp 


GGC 
Gly 


643 
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GAC CCG GTG GCG CCG GCG GAC GTG GAC GCC TTC CGA CGT GGC GTC TTC 691 
Asp Pro val Ala Pro Ala Asp Val Asp Ala Phe Arg Arg Gly val Phe 
180 185 190 

CTG CTG TCC GGG GGG ACG ACC GCG CTG CCG AAG CTG ATC CCG CGC ACC 739 

Leu Leu Ser Gly Gly Thr Thr Ala Leu Pro Lys Leu lie Pro Arg Thr 
195 200 205 210 

CAC GAC GAC TAC GCC TAC CAG TGC CGG GTC ACG GCC GGT ATC TGC GGC 787 
His Asp Asp Tyr Ala Tyr Gin Cys Arg val Thr Ala Gly lie cys Gly 

215 220 225 

CTG GAC GCG GAC AGT GTC TAT CTG GCG GTG CTG CCG GCC GAG TTC AAC 835 

Leu Asp Ala Asp Ser val Tyr Leu Ala val Leu Pro Ala Glu Phe Asn 
230 235 240 

TTC CCC TTC GGC TGC CCG GGC ATC CTG GGC ACC CTG CAC GCC GGC GGG 883 

Phe pro Phe Gly Cys Pro Gly lie Leu Gly Thr Leu His Ala Gly Gly 
245 250 255 

CGG GTG GTG TTC GCG CTG TCA CCG CAG CCC GAG GAG TGC TTC GCG CTG 931 

Arg Val val Phe Ala Leu ser Pro Gin Pro Glu Glu Cys Phe Ala Leu 
260 265 270 

ATC GAA CGC GAA CAC GTC ACC TTC ACC TCC GTC ATC CCC ACG ATC GTG 979 

lie Glu Arg Glu His Val Thr Phe Thr Ser Val lie Pro Thr lie Val 
275 280 285 290 

CAC CTG TGG CTG GCG GCC GCC GCA CAA GGC CAC GGC CGC GAC CTG GGC 1027 

His Leu Trp Leu Ala Ala Ala Ala Gin Gly His Gly Arg Asp Leu Gly 

295 300 305 

AGC CTT CAG CTG CTG CAG GTC GGC AGC GCC AAA CTC CAC GAG GAG CTC 1075 

Ser Leu Gin Leu Leu Gin val Gly Ser Ala Lys Leu His Glu Glu Leu 
310 315 320 

GCC GCC CGG ATC GGC CCC GAA CTG GGG GTG CGG CTG CAG CAG GTG TTC 1123 
Ala Ala Arg lie Gly Pro Glu Leu Gly Val Arg Leu Gin Gin Val Phe 
325 330 335 

GGC ATG GCC GAG GGA CTG CTG ACC TTC ACC CGC GAC GAC GAC CCG GCG 1171 
Gly Met Ala Glu Gly Leu Leu Thr Phe Thr Arg Asp Asp Asp Pro Ala 
340 345 350 

GAC GTG GTG CTG CGC ACC CAG GGC CGG CCG GTG TCC GAG GCC GAC GAG 1219 

Asp val Val Leu Arg Thr Gin Gly Arg Pro val Ser Glu Ala Asp Glu 
355 360 365 370 

ATA CGC GTC GCC GAC CCC GAC GGC CGG CCC GTG CCC CGC GGT GAG ACC 1267 

He Arg val Ala Asp Pro Asp Gly Arg Pro val Pro Arg Gly Glu Thr 
375 380 385 

GGT GAA CTG CTC ACC CGC GGC CCC TAC ACG CTG CGC GGC TAC TAC CGG 1315 
Gly Glu Leu Leu Thr Arg Gly Pro Tyr Thr Leu Arg Gly Tyr Tyr Arg 
390 395 400 

GCC CCC GAG CAC AAC GCC CGC GCG TTC ACC GAG GAC GGC TTC TAC CGC 1363 

Ala Pro Glu His Asn Ala Arg Ala Phe Thr Glu Asp Gly Phe Tyr Arg 
405 410 415 

AGC GGC GAT CTG GTG CGG CTC ACC GCC GAC GGG CAG TTG GTG GTG GAG 1411 

Ser Gly Asp Leu val Arg Leu Thr Ala Asp Gly Gin Leu val val Glu 
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420 425 430 

GGC AGG ATC AAG GAC GTC GTC ATC CGC GGC GGC GAC AAG GTC TCC GCG 1459 

Gly Arg lie Lys Asp val val lie Arg Gly Gly Asp Lys Val Ser Ala 
435 440 445 450 

ACC GAG GTC GAG GGC CAC CTG GGC GCC CAC CCC GAC GTC CAG CAG GCC 1507 
Thr Glu val Glu Gly His Leu Gly Ala His Pro Asp Val Gin Gin Ala 
455 460 465 

GCC GTC GTC GCC ATG CCC GAC CCG GTG TGG GGC GAG AAG GTC TGC GCC 1555 

Ala val val Ala Met Pro Asp Pro val Trp Gly Glu Lys val Cys Ala 
470 475 480 

TAC ATC GTG CCC GCA CCC GGC CGT CCC GCA CCG CCG ATG GCG GCG CTG 1603 

Tyr lie Val Pro Ala Pro Gly Arg Pro Ala Pro Pro Met Ala Ala Leu 
485 490 495 

CGC CGG CTG CTG CGC GCG CGG GGA CTG GCC GAC TAC AAG CTT CCC GAC 1651 
Arg Arg Leu Leu Arg Ala Arg Gly Leu Ala Asp Tyr Lys Leu Pro Asp 

500 505 510 

CGG GTG GAG GTC GTC GAC GCG TTC CCG CTG ACC GGC CTC AAC AAG GTC 1699 

Arg Val Glu Val Val Asp Ala Phe Pro Leu Thr Gly Leu Asn Lys val 

515 520 525 530 

GAC AAG AAG GCC CTG GCG GCC GAC ATC GCC GCC AAG ACC GCC CCC ACC 1747 

Asp Lys Lys Ala Leu Ala Ala Asp lie Ala Ala Lys Thr Ala Pro Thr 
535 540 545 

CGC CCC ACC ACC GCC GGC CAC GGC CCG ACC ACG GAC GGC GAT ACG GCC 1795 

Arg Pro Thr Thr Ala Gly His Gly Pro Thr Thr Asp Gly Asp Thr Ala 
550 555 560 

GGT GGG GGT GGG TCC GCG GGC GGG GTG ACG GCC GCC GGT GGC GGG CGG 1843 
Gly Gly Gly Gly Ser Ala Gly Gly val Thr Ala Ala Gly Gly Gly Arg 
565 570 575 

GAG GAG GCG GCG TGAGCGGGCC CGGGCCCGAG GGCG 1879 
Glu Glu Ala Ala 
580 

(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1833 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: CDNA 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 
(vi) ORIGINAL SOURCE: 

(A) ORGANISM: s . pri sti naespi ral i s 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION : 103 . . 1689 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 

GGATCCCCTC GCCCAGGGCC CTGGCGGGCC CGCCGGGCCG TGGGGGAGGT GCGGGGGCCG 60 

CGGGCCCCGG CACCGCACGA ACAGAACAAC CGCTCCGGGC CC ATG CGG ACT TCA 114 

Met Arg Thr Ser 

CGG TCC CAC GAC CAG CGG GCC CCT ACC CCC TGG AGA CAT CCC TTG CAC 162 
Arg ser His Asp Gin Arg Ala Pro Thr Pro Trp Arg His Pro Leu His 
5 10 15 20 

AGC ACC CGG CCC GCG CCC GCG GCC GAC CGT GAC CCC AGG CGC TGG GTC 210 

Ser Thr Arg Pro Ala Pro Ala Ala Asp Arg Asp Pro Arg Arg Trp val 

25 30 35 

ATC CTC GGC GTG ATC TGC CTG GCC CAA CTC GTC GTC CTG CTC GAC AAC 258 
lie Leu Gly val lie Cys Leu Ala Gin Leu Val Val Leu Leu Asp Asn 
40 45 50 

ACC GTC CTC AAC GTC GCC ATC CCG GTG CTC ACC ACC GAC CTG GGC GCC 306 
Thr Val Leu Asn Val Ala lie Pro Val Leu Thr Thr Asp Leu Gly Ala 
55 60 65 

AGC ACC GCC GAC ATC CAG TGG ATG ATC AAC GCC TAC GCG CTC GTG CAG 354 

Ser Thr Ala Asp He Gin Trp Met lie Asn Ala Tyr Ala Leu val Gin 
70 75 80 

TCC GGG CTG CTG CTC ACC GCG GGC AGC CTC GCG GAC CGC TAC GGC CGC 402 
Ser Gly Leu Leu Leu Thr Ala Gly Ser Leu Ala Asp Arg Tyr Gly Arg 
85 90 95 100 

AAA CGG CTG CTG ATG CTC GGA CTG GTG CTC TTC GGC GCC GGG TCC GCC 450 

Lys Arg Leu Leu Met Leu Gly Leu Val Leu Phe Gly Ala Gly Ser Ala 
105 110 115 

TGG GCG GCC TTC GCC CAG GAC TCC GCC CAA CTC ATC GCC GCC CGG GCC 498 
Trp Ala Ala Phe Ala Gin Asp Ser Ala Gin Leu lie Ala Ala Arg Ala 
120 125 130 

GGC ATG GGC GTG GGC GGG GCG CTG CTG GCG ACC ACC ACC CTC GCC GTC 546 
Gly Met Gly val Gly Gly Ala Leu Leu Ala Thr Thr Thr Leu Ala Val 
135 140 145 

ATC ATG CAG GTC TTC GAC GAC GAC GAA CGC CCC CGG GCG ATC GGC CTG 594 

lie Met Gin val Phe Asp Asp Asp Glu Arg Pro Arg Ala lie Gly Leu 
150 155 160 

TGG GGA GCG GCC AGC TCA CTG GGC TTC GCG GCC GGC CCG CTG CTC GGC 642 

Trp Gly Ala Ala ser Ser Leu Gly Phe Ala Ala Gly Pro Leu Leu Gly 
165 170 175 180 

GGC GCC CTC CTC GAC CAC TTC TGG TGG GGC TCC ATC TTC CTG ATC AAC 690 

Gly Ala Leu Leu Asp His Phe Trp Trp Gly Ser lie Phe Leu lie Asn 
185 190 195 

CTG CCC GTC GCG CTG CTG GGC CTG CTG GCC GTC GCC CGC CTG GTG CCC 738 
Leu Pro val Ala Leu Leu Gly Leu Leu Ala val Ala Arg Leu val Pro 
200 205 210 

GAG ACG AAG AAC CCC GAA GGC CGG CGC CCC GAC CTG CTC GGC GCC GTG 786 
Glu Thr Lys Asn Pro Glu Gly Arg Arg Pro Asp Leu Leu Gly Ala Val 
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215 220 225 

CTC TCC ACC CTC GGC ATG GTC GGC GTC GTC TAG GCC ATC ATC TCC GGC 834 

Leu ser Thr Leu Gly Met val Gly val val Tyr Ala He lie Ser Gly 
230 235 240 

CCC GAA CAC GGC TGG ACG GCC CCG CAG GTC CTC CTG CCG GCC GCC GTC 882 
Pro Glu His Gly Trp Thr Ala Pro Gin Val Leu Leu Pro Ala Ala val 
245 250 255 260 

GCG GCC GCC GCG CTC ACC GCG TTC GTC CGC TGG GAA CTG CAC ACC CCC 930 
Ala Ala Ala Ala Leu Thr Ala Phe Val Arg Trp Glu Leu His Thr Pro 
265 270 275 

CAC CCC ATG CTC GAC ATG GGC TTC TTC ACC 6AC CGG CGC TTC AAC GGG 978 

His Pro Met Leu Asp Met Gly Phe Phe Thr Asp Arg Arg Phe Asn Gly 
280 285 290 

CCG TCG CCG GCG GAG TGC TCG TCG TTC GGC ATG GCC GGC TCG CTC TTC 1026 
Pro ser Pro Ala Glu Cys Ser ser Phe Gly Met Ala Gly Ser Leu Phe 

295 300 305 

CTG CTC ACC CAG CAC CTC CAA CTC GTC CTC GGC TAC GAC GCC CTG CAG 1074 

Leu Leu Thr Gin His Leu Gin Leu Val Leu Gly Tyr Asp Ala Leu Gin 
310 315 320 

GCC GGC CTG CGC ACC GCG CCA CTG GCT TTG ACG ATC GTC GCC CTC AAC 1122 
Ala Gly Leu Arg Thr Ala Pro Leu Ala Leu Thr lie Val Ala Leu Asn 
325 330 335 340 

CTG GCC GGC CTC GGC GCG AAA CTC CTC GCC GCG CTC GGC ACC GCC CGC 1170 

Leu Ala Gly Leu Gly Ala Lys Leu Leu Ala Ala Leu Gly Thr Ala Arg 
345 350 355 

AGC ATC GCC CTG GGC ATG ACA CTG CTG GCC GCC GGC CTC AGC GCG GTG 1218 

ser lie Ala Leu Gly Met Thr Leu Leu Ala Ala Gly Leu Ser Ala val 
360 365 370 

GCC GTC GGC GGA TCG GGC CCC GAC GCC GGC TAC GGC GGC ATG CTC GCC 1266 
Ala Val Gly Gly Ser Gly Pro Asp Ala Gly Tyr Gly Gly Met Leu Ala 
375 380 385 

GGC CTG CTC CTC ATG GGC GCG GGC ATC GCA CTG GCC ATG CCC GCC ATG 1314 

Gly Leu Leu Leu Met Gly Ala Gly lie Ala Leu Ala Met Pro Ala Met 
390 395 400 

GCC ACC GCC GTG ATG TCC TCC ATC CCG CCC GCC AAG GCC GGG GCC GGA 1362 
Ala Thr Ala val Met Ser ser lie Pro Pro Ala Lys Ala Gly Ala Gly 
405 410 415 420 

GCG GGC GTG CAG GGC ACC CTG ACC GAG TTC GGC GGC GGA CTG GGA GTG 1410 

Ala Gly Val Gin Gly Thr Leu Thr Glu Phe Gly Gly Gly Leu Gly val 
425 430 435 

GCG ATC CTC GGC GCC GTC CTC GGC TCC CGC TTC GCC TCC CAA CTG CCC 1458 

Ala lie Leu Gly Ala val Leu Gly Ser Arg Phe Ala Ser Gin Leu Pro 
440 445 450 

GCC GCC ATC ACC GGC ACC GGC TCC CTC GAC GAG GCA CTG CGC GAC GCC 1506 

Ala Ala He Thr Gly Thr Gly Ser Leu Asp Glu Ala Leu Arg Asp Ala 
455 460 465 

ACA CCC CAA CAG GCC GGG CAG GTC CAC GAC GCG TTC GCC GAC GCG GTG 1554 
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Thr Pro Gin Gin Ala Gly Gin val His Asp Ala Phe Ala Asp Ala val 
470 475 480 

AAC ACC AGC CAA CTC ATC GGC GCC GCC GCC GTG TTC ACC GGC GGC CTG 1602 

Asn Thr Ser Gin Leu lie Gly Ala Ala Ala val Phe Thr Gly Gly Leu 
485 490 495 500 

CTC GCC GCG CTG CTG CTG CAC CGC GCC GAC CGC AAG GCC GCC CCC CAG 1650 
Leu Ala Ala Leu Leu Leu His Arg Ala Asp Arg Lys Ala Ala Pro Gin 
505 510 515 

CCC ACC GCC CCC ACC CCC GAA CCC ACC ACC ACC GCC TGACCCCCGG 1696 

Pro Thr Ala Pro Thr Pro Glu Pro Thr Thr Thr Ala 

520 525 

CCCGCCGGGC ACCACACAAC CCACGGCCCC ACCCCTGCGG CTCCCCACCG GGACCCACAG 1756 
GGGCGGGGCC GTGCCGCTGC CCTGCCCACA CACACAGCCC CCACACACAC AGCCCCCGCA 1816 
CGGCCGACAG CGCCGGG 1833 

(2) INFORMATION FOR SEQ ID NO: 7: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 695 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS:' double 

(D) topology: linear 

(ii) MOLECULE type: CDNA 

(iii) hypothetical: no 

(iv) ANTI-SENSE: NO 
(vi) ORIGINAL SOURCE: 

(A) ORGANISM: S - pri sti naespi ral i s 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 212.. 695 

(D) OTHER INFORMATION: /product= "Gene snac" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 

CTCGAGCCGC GCCCCCAGGT GCTGGTGTCG CTCGCCGTGG AGAAGGGCGC CGACGGCACC 60 

GCGCCGCCGG ACCGGCTGCT GATCCACGAC GGCTTCCCCT GGGGCCGCGC CGCCCCGCGC 120 

GAAGCGGAGC TGCCCACCGG GCACCGCGCC CTGCCGGCCC TGGCCGGCGC CGCCCGCTGA 180 

GGCGCGGCAA CCACCAACAG AAGGAGCCCC C GTG ACA GGA GCC GAC GAC CCG 232 

val Thr Gly Ala Asp Asp Pro 

GCA AGG CCC GCG GTC GGC CCG CAG AGT TTC CGA GAC GCG ATG GCG CAG 280 
Ala Arg Pro Ala Val Gly Pro Gin Ser Phe Arg Asp Ala Met Ala Gin 
10 15 20 

CTG GCG TCG CCC GTC ACC GTC GTA ACC GTC CTC GAC GCG GCC GGA CGC 328 

Leu Ala Ser Pro val Thr val val Thr val Leu Asp Ala Ala Gly Arg 
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25 30 35 

CGC CAC GGC TTC ACG GCC GGC TCG GTG GTC TCT GTG TCG CTG GAC CCG 376 

Arq His Gly Phe Thr Ala Gly Ser Val Val Ser va] Ser Leu Asp Pro 
40 45 50 55 

CCG CTG GTG ATG GTC GGC ATC GCG CTC ACC TCC AGC TGC CAC ACG GCG 424 
Pro Leu val Met val Gly lie Ala Leu Thr ser ser Cys His Thr Ala 
60 65 70 

ATG GCC GCC GCC GCC GAG TTC TGC GTC AGC ATC CTC GGC GAG GAC CAG 472 

Met Ala Ala Ala Ala Glu Phe Cys val ser lie Leu Gly Glu Asp Gin 
75 80 85 

CGC GCC GTC GCG AAG CGG TGC GCG ACG CAC GGC GCC GAC CGG TTC GCG 520 

Arg Ala val Ala Lys Arg cys Ala Thr His Gly Ala Asp Arg Phe Ala 
90 95 100 

GGC GGC GAG TTC GCC GCC TGG GAC GGT ACG GGG GTG CCC TAC CTG CCG 568 
Gly Gly Glu Phe Ala Ala Trp Asp Gly Thr Gly val Pro Tyr Leu Pro 
105 110 115 

GAC GCC AAG GTC GTC CTG CGC TGC CGC ACC ACG GAC GTG GTG CGC GCC 616 

Asp Ala Lys val val Leu Arg Cys Arg Thr Thr Asp val Val Arg Ala 
120 125 130 135 

GGC GAC CAC GAC CTG GTG CTC GGC ACG CCC GTG GAG ATC CGC ACG GGC 664 

Gly Asp His Asp Leu val Leu Gly Thr Pro val Glu lie Arg Thr Gly 
140 145 150 

GAC CCG GCG AAG CCA CCC CTG CTG TGG TAC C 695 
Asp Pro Ala Lys Pro Pro Leu Leu Trp Tyr 
155 160 



(2) INFORMATION FOR SEQ ID NO: 8: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 640 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: CDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTI- SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: S. pri sti naespi rali s 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..640 

(D) OTHER INFORMATION: /product= "gene SnaD" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 
GCG ACC GCC CGG CTC ATC GGC CCG CTG CCG CGC CGG CTG GGC CTC CAG 48 

Ala Thr Ala Arg Leu lie Gly Pro Leu Pro Arg Arg Leu Gly Leu Gin 
15 10 15 

Page 17 



03806.0054-04. seql st 

GTG CAC CAG GTG ATG ACG GGC GCG TTC GCG CAG GCC CTC GCC CGC TGG 96 

val His Gin val Met Thr Gly Ala Phe Ala Gin Ala Leu Ala Arg Trp 
20 25 30 

CGG GGC AGC CGC GCC GTC ACC TTC GAC GTG GAG ACC CAC GGA CGG CAC 144 

Arg Gly Ser Arg Ala Val Thr Phe Asp val Glu Thr His Gly Arg His 
35 40 45 

GGC CGC GAC GAA CTG TTC CGT ACC GTC GGC TGG TTC ACC TCC ATC CAC 192 
Gly Arg Asp Glu Leu Phe Arg Thr val Gly Trp Phe Thr Ser lie His 
50 55 60 

CCC GTC GTC CTG GGC GCG GAC CGC TCC GTG CAC CCC GAG CAG TAC CTC 240 
Pro Val Val Leu Gly Ala Asp Arg ser Val His Pro Glu Gin Tyr Leu 
65 70 75 80 

GCC CAG ATC GGC GCG GCG CTG ACC GCC GTA CCG GAC GGC GGC GTC GGC 288 

Ala Gin lie Gly Ala Ala Leu Thr Ala Ala Pro Asp Gly Gly Val Gly 
85 90 95 

TTC GGC GCC TGC CGC GAG TTC TCC CCG GAC GCC GGG CTG CGC ACT CTG 336 
Phe Gly Ala Cys Arg Glu Phe Ser Pro Asp Ala Gly Leu Arg Thr Leu 
100 105 110 

CTG CGT GAC CTG CCG CCC GCC CTG GTG TGC TTC AAC TAC TAC GGT CAG 384 

Leu Arg Asp Leu Pro Pro Ala Leu Val Cys Phe Asn Tyr Tyr Gly Gin 
115 120 125 

GCC GAC CAG TTG AGC CCG AAC GGC GGT TTC CGT ATG TCG GGC CGT CCC 432 

Ala Asp Gin Leu Ser Pro Asn Gly Gly Phe Arg Met Ser Gly Arg Pro 
130 135 140 

ATC CCG CGC GAG CAC TCC GCC CGC TGC GAG CGC GTC TAC GGC ATC GAG 480 

lie Pro Arg Glu His Ser Ala Arg Cys Glu Arg Val Tyr Gly lie Glu 
145 150 155 160 

GTG TAC GGC ATC GTC CAC GGC GGC CGC CTG CGC ATG GGC CTG ACC TGG 528 
val Tyr Gly lie Val His Gly Gly Arg Leu Arg Met Gly Leu Thr Trp 
165 170 175 

GTG CCG AGC CCG GCG GAC GGT GTG GAC GAG GCC GGC GTC GAC GCG CTC 576 
Val Pro Ser Pro Ala Asp Gly val Asp Glu Ala Gly val Asp Ala Leu 
180 185 190 

GTG GAG CAG ATG AGC TGG GTG CTG GCC ACG CTC GCG GGC GCC GAC CCG 624 

val Glu Gin Met ser Trp val Leu Ala Thr Leu Ala Gly Ala Asp Pro 
195 200 205 

CAC GCC GTG ACC CCG G 640 
His Ala val Thr Pro 
210 

(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 645 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: CDNA 
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(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 
(vi) ORIGINAL SOURCE: 

(A) ORGANISM: S . pri sti naespi ral i s 

(ix) FEATURE: 

(A) name/key: CDS 

(B) location: 61. .645 

(D) OTHER information: /product= "gene papA" 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 
GGCGTCAAGA ACCTGCCGCT GACCGTACGG CGCGGCTGAC ACAGACAAGG GGGCCACCTG 60 
GTG CGC ACC GTG CGA ACC CTG CTG ATC GAC AAC TAC GAC TCG TTC ACC 108 

val Arg Thr Val Arg Thr Leu Leu lie Asp Asn Tyr Asp ser Phe Thr 
15 10 15 

TAC AAC CTC TTC CAG ATG CTG GCC GAG GTG AAC GGC GCC GCT CCG CTC 156 

Tyr Asn Leu Phe Gin Met Leu Ala Glu val Asn Gly Ala Ala Pro Leu 
20 25 30 

GTC GTC CGC AAC GAC GAC ACC CGC ACC TGG CAG GCC CTG GCG CCG GGC 204 

Val val Arg Asn Asp Asp Thr Arg Thr Trp Gin Ala Leu Ala Pro Gly 
35 40 45 

GAC TTC GAC AAC GTC GTC GTC TCA CCC GGC CCC GGC CAC CCC GCC ACC 252 
Asp Phe Asp Asn Val Val val ser Pro Gly Pro Gly His Pro Ala Thr 
50 55 60 

GAC ACC GAC CTG GGC CTC AGC CGC CGG GTG ATC ACC GAA TGG GAC CTG 300 

Asp Thr Asp Leu Gly Leu Ser Arg Arg val lie Thr Glu Trp Asp Leu 
65 70 75 80 

CCG CTG CTC GGG GTG TGC CTG GGC CAC CAG GCC CTG TGC CTG CTC GCC 348 
Pro Leu Leu Gly val Cys Leu Gly His Gin Ala Leu Cys Leu Leu Ala 
85 90 95 

GGC GCC GCC GTC GTC CAC GCA CCC GAA CCC TTT CAC GGC CGC ACC AGC 396 

Gly Ala Ala val Val His Ala Pro Glu Pro Phe His Gly Arg Thr ser 
100 105 110 

GAC ATC CGC CAC GAC GGG CAG GGC CTG TTC GCG AAC ATC CCC TCC CCG 444 
Asp lie Arg His Asp Gly Gin Gly Leu Phe Ala Asn lie Pro Ser Pro 
115 120 125 

CTG ACC GTG GTC CGC TAC CAC TCG CTG ACC GTC CGG CAA CTG CCC GCC 492 
Leu Thr val val Arg Tyr His Ser Leu Thr Val Arg Gin Leu Pro Ala 
130 135 140 

GAC CTG CGC GCC ACC GCC CAC ACC GCC GAC GGG CAG CTG ATG GCC GTC 540 
Asp Leu Arg Ala Thr Ala His Thr Ala Asp Gly Gin Leu Met Ala Val 
145 150 155 160 

GCC CAC CGC CAC CTG CCC CGC TTC GGC GTG CAG TTC CAC CCC GAA TCG 588 
Ala His Arg His Leu Pro Arg Phe Gly val Gin Phe His Pro Glu ser 
165 170 175 

ATC AGC AGC GAA CAC GGC CAC CGG ATG CTC GCC AAC TTC CGC GAC CTG 636 
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lie Sen Ser Glu His Gly His Arg Met Leu Ala Asn Phe Arg Asp Leu 
180 185 190 

TCC CTG CGC 645 
Ser Leu Arg 
195 



(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1052 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: CDNA 

(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 
(vi) ORIGINAL SOURCE: 

(A) ORGANISM: S. pri sti naespi ral i s 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 84.. 962 

(D) OTHER INFORMATION: /produCt= "Gene PapM" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

CTCGAGGACG AGTGGATCGC CTCCGGCGGC GCCCCCGTCC CCACGCCCGT GCACGCGTCC 60 

GCGTCCGCGC GGGGGGCCGT GTC GTG ACC GCC GCC GCA CCC ACC CTC GCC 110 

val Thr Ala Ala Ala Pro Thr Leu Ala 
1 5 

CAG GCG CTG GAC GAG GCC ACC GGG CAG CTG ACC GGC GCC GGG ATC ACC 158 

Gin Ala Leu Asp Glu Ala Thr Gly Gin Leu Thr Gly Ala Gly lie Thr 
10 15 20 25 

GCC GAC GCC GCC CGG GCC GAC ACC CGG CTG CTG GCC GCC CAC GCC TGC 206 
Ala Asp Ala Ala Arg Ala Asp Thr Arg Leu Leu Ala Ala His Ala Cys 
30 35 40 

CAG GTC GCC CCG GGG GAC CTC GAC ACC TGC CTG GCC GGC CCG GTG CCG 254 

Gin Val Ala Pro Gly Asp Leu Asp Thr Cys Leu Ala Gly Pro val Pro 
45 50 55 

CCC CGG TTC TGG CAC TAC GTC CGG CGC CGT CTG ACC CGC GAA CCC GCC 302 

Pro Arg Phe Trp His Tyr Val Arg Arg Arg Leu Thr Arg Glu Pro Ala 
60 65 70 

GAA CGC ATC GTC GGC CAC GCC TAC TTC ATG GGC CAC CGC TTC GAC CTG 350 

Glu Arg lie val Gly His Ala Tyr Phe Met Gly His Arg Phe Asp Leu 
75 80 85 

GCC CCC GGC GTC TTC GTC CCC AAA CCC GAG ACC GAG GAG ATC ACC CGG 398 
Ala Pro Gly Val Phe Val Pro Lys Pro Glu Thr Glu Glu lie Thr Arg 
90 95 100 105 
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GAC GCC ATC GCC CGC CTG GAG GCC CTC GTC CGC CGC GGC ACC ACC GCA 446 
Asp Ala lie Ala Arg Leu Glu Ala Leu Val Arg Arg Gly Thr Thr Ala 
110 . 115 120 

CCC CTG GTC GTC GAC CTG TGC GCC GGA CCG GGC ACC ATG GCC GTC ACC 494 

Pro Leu val val Asp Leu Cys Ala Gly Pro Gly Thr Met Ala val Thr 
125 130 135 

CTG GCC CGC CAC GTA CCG GCC GCC CGC GTC CTG GGC ATC GAA CTC TCC 542 
Leu Ala Arg His val Pro Ala Ala Arg val Leu Gly lie Glu Leu ser 
140 145 150 

CAG GCC GCC GCC CGC GCC GCC CGG CGC AAC GCC CGC GGC ACC GGC GCC 590 
Gin Ala Ala Ala Arg Ala Ala Arg Arg Asn Ala Arg Gly Thr Gly Ala 
155 160 165 

CGC ATC GTG CAG GGC GAC GCC CGC GAC GCC TTC CCC GAA CTG AGC GGC 638 

Arg lie val Gin Gly Asp Ala Arg Asp Ala Phe Pro Glu Leu Ser Gly 
170 175 180 185 

ACC GTC GAC CTC GTC GTC ACC AAC CCG CCC TAC ATC CCC ATC GGA CTG 686 

Thr Val Asp Leu Val val Thr Asn Pro Pro Tyr lie Pro lie Gly Leu 
190 195 200 

CGC ACC TCC GCA CCC GAA GTG CTC GAG CAC GAC CCG CCG CTG GCC CTG 734 

Arg Thr ser Ala Pro Glu val Leu Glu His Asp Pro Pro Leu Ala Leu 
205 210 215 

TGG GCC GGG GAG GAG GGC CTC GGC ATG ATC CGC GCC ATG GAA CGC ACC 782 
Trp Ala Gly Glu Glu Gly Leu Gly Met lie Arg Ala Met Glu Arg Thr 

220 225 230 

GCG GCC CGG CTG CTG GCC CCC GGC GGC GTC CTG CTC CTC GAA CAC GGC 830 

Ala Ala Arg Leu Leu Ala Pro Gly Gly val Leu Leu Leu Glu His Gly 
235 240 245 

TCC TAC CAA CTC GCC TCC GTG CCC GCC CTG TTC CGC GCA ACC GGC CGC 878 
ser Tyr Gin Leu Ala Ser Val Pro Ala Leu Phe Arg Ala Thr Gly Arg 
250 255 260 265 

TGG AGC CAC GCC TCG TCC CGT CCC ACC TGC AAC GAC GGC TGC CTG ACC 926 

Trp Ser His Ala Ser Ser Arg Pro Thr Cys Asn Asp Gly Cys Leu Thr 
270 275 280 

GCC GTA CGC AAC CAC ACC TGC GCA CCG CCC GCC TGACACGGCG TCACGGCACG 979 

Ala val Arg Asn His Thr Cys Ala Pro Pro Ala 
285 290 

GCCGGCCTGT CGGCAACGAC CCTACGCCAT TGACAAACCG ACCGTGCCGT I I I I I lAATG 1039 

TCGGGGTGGC GGA 1052 



(2) INFORMATION FOR SEQ ID NO: 11: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 227 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: CDNA 

(iii) hypothetical: no 

(iv) ANTI-SENSE: NO 
(vi) ORIGINAL SOURCE: 

(A) ORGANISM: S.pristinaespi ralis 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 3. .227 

(D) OTHER INFORMATION: /product= " Parti 6 du gene Snbc" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 

AG ATC TTC GAG CAC AAG ACC GTC GCC CAG CTC GCA CCC GTC GCC GAG 47 
lie Phe Glu His Lys Thr val Ala Gin Leu Ala Pro val Ala Glu 
15 10 15 

ACG CTC GCC GAC ACC ACC CGC GAG GAA CCC GCC GCC GTC GCC GCG ACC 95 

Thr Leu Ala Asp Thr Thr Arg Glu Glu Pro Ala Ala val Ala Ala Thr 
20 25 30 

GGC GAC GTA CCG CTC ACC CCG ATC ATG CAC TGG CTG CGC GAA CGC GGC 143 

Gly Asp val Pro Leu Thr Pro lie Met His Trp Leu Arg Glu Arg Gly 
35 40 45 

GGC CCC GTC GAC GCG TTC AGC CAG ACG ATG GCC GTC ACC GTC CCC GCC 191 

Gly Pro val Asp Ala Phe Ser Gin Thr Met Ala Val Thr val Pro Ala 
50 55 60 

GGC CTG GAC CGG GAA CGG CTC GTG GCC GCC CTG CAG 227 

Gly Leu Asp Arg Glu Arg Leu val Ala Ala Leu Gin 
65 70 75 



(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 247 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 
(vi) ORIGINAL SOURCE: 

(A) ORGANISM: S . pri sti naespi ral i s 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1. .247 

(D) OTHER INFORMATION: /product= " Parti e du gene SnbC" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 
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CTC GAG TAC GAC ACC GCC CTG TAC GAG CGG GCC ACC GCC GAA GCC CTC 48 

Leu Glu Tyr Asp Thr Ala Leu Tyr Glu Arg Ala Thr Ala Glu Ala Leu 
15 10 15 

ACC GGC CGG CTG CTG CGG CTC CTC GAC GCC GTC GTC ACC GAC CCG CAG 96 

Thr Gly Arg Leu Leu Arg Leu Leu Asp Ala val val Thr Asp Pro Gin 
20 25 30 

GCG CCG GTC GGC TCC CAC GAC CTC CTC GAA GAG GCC GAA CAC GCC CGC 144 
Ala Pro val Gly Ser His Asp Leu Leu Glu Glu Ala Glu His Ala Arg 
35 40 45 

CTG GCA GCC TTC AAC GAC ACC GCC CGG CCC GTG CCG CGA GCC GGC CTC 192 

Leu Ala Ala Phe Asn Asp Thr Ala Arg Pro val Pro Arg Ala Gly Leu 
50 55 60 

GCC GAA CTC TTC ACC GCC CAG GCC CGC CGC ACC GCC GAT GCG GTC GCC 240 
Ala Glu Leu Phe Thr Ala Gin Ala Arg Arg Thr Ala Asp Ala val Ala 
65 70 75 80 

GTC GTC G 247 
Val val 



(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 192 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: liTtear 

(ii) MOLECULE TYPE: CDNA 

(iii) hypothetical: no 

(iv) anti-sense: no 

(vi) original source: 

(A) ORGANISM: s . pri sti naespi ral i s 

(ix) feature: 

(A) NAME/KEY: CDS 

(B) LOCATION : 3 . - 192 

(D) OTHER INFORMATION: /product= "Parti e du gene SnbD" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 
GC ATG CCC CCC GTC ACC CCC TAC CGC GCC TAC CTG GCC CAC CTC GCC 47 

Met pro Pro val Thr Pro Tyr Arg Ala Tyr Leu Ala His Leu Ala 
15 10 15 

GGC CGT GAC GAC GAC GCC GCC CGC GCC GCG TGG CGG ACC GCC CTC GCG 95 

Gly Arg Asp Asp Asp Ala Ala Arg Ala Ala Trp Arg Thr Ala Leu Ala 
20 25 30 

GAC CTG GAG GAG CCG AGC CTC GTC GCG GGC GCC GGA GCA GGC CGC GGC 143 

Asp Leu Glu Glu Pro Ser Leu val Ala Gly Ala Gly Ala Gly Arg Gly 
35 40 45 
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GCC GCC GAC GGC TCC GCC CTG CCC GGC CAG ATC CCC GGT TAG CGA GCT C 192 
Ala Ala Asp Gly Ser Ala Leu Pro Gly Gin lie Pro Gly Tyr Arg Ala 
50 55 60 

(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 474 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

Cii) MOLECULE TYPE: CDNA 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 
(vi) ORIGINAL SOURCE: 

(A) ORGANISM: S . pri sti naespi ral i s 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1. .474 

(D) OTHER INFORMATION: /product= "Parti e du gene SnbD" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 

CTG CAG GTC GAG GGC CGG CCC GCG CAC CTG GAA CTG CCC TGC GAC CAC 48 
Leu Gin val Glu Gly Arg Pro Ala His Leu Glu Leu Pro Cys Asp His 
15 10 15 

CCC CGG CCC GCC GTC GCC ACC CAC CGC GGC GCC ACC GTG CCC TTC CAC 96 
Pro Arg Pro Ala Val Ala Thr His Arg Gly Ala Thr Val Pro Phe His 
20 25 30 

ATC GAC GCC GGC CTC CAC GAG AAG CTG ACC GCG CTC TCC AAG GCC TGC 144 
lie Asp Ala Gly Leu His Glu Lys Leu Thr Ala Leu Ser Lys Ala Cys 
35 40 45 

GAC AGC AGC CTG TTC ATG GTG CTC CAG GCC GCG GTC GCC GCC CTG CTC 192 

Asp ser ser Leu Phe Met val Leu Gin Ala Ala val Ala Ala Leu Leu 

50 55 60 

ACC CGG CAC GGC GCC GGC ACC GAC ATC CCC GTC GGC AGC CCC GTC GCC 240 

Thr Arg His Gly Ala Gly Thr Asp lie Pro Val Gly Ser Pro val Ala 
65 70 75 80 

GGC CGC ACC GAC GAC GCC CTC GAC GAC CTG GTG GGC TTC TTC GTC AAC 288 
Gly Arg Thr Asp Asp Ala Leu Asp Asp Leu val Gly Phe Phe val Asn 
85 90 95 

ACC CTC GTC CTG CGC ACC GAC ACC TCC GGC GAC CCC ACC TTC CGC GAA 336 
Thr Leu val Leu Arg Thr Asp Thr Ser Gly Asp Pro Thr Phe Arg Glu 
100 105 110 

CTC GTC GCA CGC GTG CGG CAG TTC GAC CTC GCC GCC TAC ACG CAC CAG 384 
Leu Val Ala Arg val Arg Gin Phe Asp Leu Ala Ala Tyr Thr His Gin 
115 120 125 

GAC ATG CCG TTC GAA AAG CTC GTC GAA GAG GTC AAC CCC GAG CGC TCC 432 
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Asp Met Pro Phe Glu Lys Leu val Glu Glu val Asn Pro Glu Arg ser 
130 135 140 

CTG GCC CGC AAC CCG CTC TTC CAG GTC GTC CTG GCG CTG CAG 

Leu Ala Arg Asn Pro Leu Phe Gin val val Leu Ala Leu Gin 
145 150 155 



(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 485 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) topology: linear 

(ii) MOLECULE TYPE: CDNA 
(iii) HYPOTHETICAL: NO 

(iv) anti-sense: no 

(Vi) ORIGINAL SOURCE: 

(A) ORGANISM: s . pri sti naespi ral i s 

(ix) FEATURE: 

(A) name/key: CDS 

(B) LOCATION: 3.. 485 

(D) OTHER INFORMATION: /product= " Parti e du gene SnbE" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 

GC ATG CCG CGC TCC CTC GAC CTG TAC GTC GCA CTG CTC GCC GTC CTC 

Met Pro Arg Ser Leu Asp Leu Tyr Val Ala Leu Leu Ala Val Leu 
15 10 15 

AAG ACC GGC GCC GCC TAC CTG CCC GTC GAC ATC TCC TAC CCG GCC GAA 
Lys Thr Gly Ala Ala Tyr Leu Pro Val Asp lie Ser Tyr Pro Ala Glu 
20 25 30 

CGC ATC GCG TTC ATG ATC GAG GAC GCC CGC CCG GTG ACC GTC CTC GAC 

Arg lie Ala Phe Met lie Glu Asp Ala Arg Pro val Thr Val Leu Asp 
35 40 45 

CGC CTG CCC GAC GAC CTG GGC GCC TAC CGG GAC ACC GAC CTC ACC GAC 

Arg Leu Pro Asp Asp Leu Gly Ala Tyr Arg Asp Thr Asp Leu Thr Asp 
50 55 60 

GCC GAC CGC ACG GCG CCG CTA CGG CCC GAA CAC CCG GCG TAC GTC ATC 

Ala Asp Arg Thr Ala Pro Leu Arg Pro Glu His Pro Ala Tyr val lie 
65 70 75 

CAC ACC TCC GGC TCC ACC GGC ACC CCC AAG GCC GTC GTC ATG CCC CAC 

His Thr Ser Gly Ser Thr Gly Thr Pro Lys Ala val Val Met Pro His 
80 85 90 95 

GCC GGC CTG GTC AAC CTG CTG ACC TGG CAC GCC CGC CGC TTC CCC GGC 
Ala Gly Leu val Asn Leu Leu Thr Trp His Ala Arg Arg Phe Pro Gly 
100 105 110 

GGC ACC GGG GTG CGC ACC GCC CAG TTC ACC GCC ATC GGC TTC GAC TTC 
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Gly Thr Gly Val Arg Thr Ala Gin Phe Thr Ala He Gly Phe Asp Phe 
115 120 125 

TCG GTG CAG GAG ATC CTC TCC CCG CTC GTC ATG GGC AAG ACC CTC GCC 431 

Ser val Gin Glu lie Leu Ser Pro Leu val Met Gly Lys Thr Leu Ala 
130 135 140 

GTG CCC TCG GAA GAG GTC CGC CAC AGC GCC GAA CTG CTG GCC GGC TGG 479 
Val Pro ser Glu Glu val Arg His Ser Ala Glu Leu Leu Ala Gly Trp 
145 150 155 

CTC GAG 485 

Leu Glu 

160 



(2) INFORMATION FOR SEQ ID NO: 16: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 291 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) topology: linear 
(ii) MOLECULE type: cdna 

(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 
(vi) ORIGINAL SOURCE: 

(A) ORGANISM: S . pri sti naespi ral i s 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..291 

(D) OTHER INFORMATION: /product= "Parti e du gene SnbE" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 
CTG CAG GCC GAG GGC GCC GAA GTG AGC CTG CTG GCC GTC CTC GAC GGC 48 

Leu Gin Ala Glu Gly Ala Glu val Ser Leu Leu Ala val Leu Asp Gly 
15 10 15 

TAC CCC GAC GCC TAC GAC GGC ACC GAG CAC GAG GTC GGC GAG GAA CAG 96 
Tyr Pro Asp Ala Tyr Asp Gly Thr Glu His Glu val Gly Glu Glu Gin 
20 25 30 

GTC CTG GCG ATC CTC CTC AAC GCC GCC GGC GTC GAC CGG GCC CAG GCC 144 

Val Leu Ala He Leu Leu Asn Ala Ala Gly Val Asp Arg Ala Gin Ala 
35 40 45 

TTC GGC GAC GCC CCC CTC CAA CGG GCC GCC GTG CTC GAG AAG CTG CGC 192 
Phe Gly Asp Ala Pro Leu Gin Arg Ala Ala Val Leu Glu Lys Leu Arg 
50 55 60 

GAC AGC GGC AGC GCC CTG GGC AAC CTC GAC GAC GAC GCG GTC GGC CGC 240 
Asp Ser Gly Ser Ala Leu Gly Asn Leu Asp Asp Asp Ala val Gly Arg 
65 70 75 80 

ATG GTC ACC GTC TTC CTC AAC AAC ACG CGC CTC ATC CAG AAC TTC CGG 288 
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Met val Thr val Phe Leu Asn Asn Thr Arg Leu lie Gin Asn Phe Arq 
85 90 95 

CCC 291 
Pro 

(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 422 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 

Met Thr Ala Pro Arg Arg Arg lie Thr Leu Ala Gly lie lie Asp Gly 
15 10 15 

Pro Gly Gly His Val Ala Ala Trp Arg His Pro Ala Thr Lys Ala Asp 
20 25 30 

Ala Gin Leu Asp Phe Glu Phe His Arg Asp Asn Ala Arg Thr Leu Glu 
35 40 45 

Arg Gly Leu Phe Asp Ala Val Phe lie Ala Asp lie val Ala Val Trp 
50 55 60 

Gly Thr Arg Leu Asp ser Leu Cys Arg Thr Ser Arg Thr Glu His Phe 
65 70 75 80 

Glu Pro Leu Thr Leu Leu Ala Ala Tyr Ala Ala val Thr Glu His lie 
85 90 95 

Gly Leu Cys Ala Thr Ala Thr Thr Thr Tyr Asn Glu Pro Ala His lie 
100 105 110 

Ala Ala Arg Phe Ala Ser Leu Asp His Leu ser Gly Gly Arg Ala Gly 
115 120 125 

Trp Asn val val Thr Ser Ala Ala Pro Trp Glu ser Ala Asn Phe Gly 
130 135 140 

Phe Pro Glu His Leu Glu His Gly Lys Arg Tyr Glu Arg Ala Glu Glu 
145 150 155 160 

Phe lie Asp val val Lys Lys Leu Trp Asp Ser Asp Gly Arg Pro val 
165 170 175 

Asp His Arg Gly Thr His Phe Glu Ala Pro Gly Pro Leu Gly lie Ala 
180 185 190 

Arg Pro Pro Gin Gly Arg Pro val lie lie Gin Ala Gly Ser Ser Pro 
195 200 205 

val Gly Arg Glu Phe Ala Ala Arg His Ala Glu val lie Phe Thr Arg 
210 215 220 

His Asn Arg Leu Ser Asp Ala Gin Asp Phe Tyr Gly Asp Leu Lys Ala 
225 230 235 240 

Arg Val Ala Arg His Gly Arg Asp Pro Glu Lys val Leu val Trp Pro 
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245 250 255 

Thr Leu Ala Pro lie val Ala Ala Thr Asp Thr Glu Ala Lys Gin Arg 
260 265 270 

Leu Gin Glu Leu Gin Asp Leu Thr His Asp His val Ala Leu Arg Thr 
275 280 285 

Leu Gin Asp His Leu Gly Asp val Asp Leu Ser Ala Tyr Pro lie Asp 
290 295 300 

Gly Pro val Pro Asp lie Pro Tyr Thr Asn Gin Ser'Gln Ser Thr Thr 
305 310 315 320 

Glu Arg Leu lie Gly Leu Ala Arg Arg Glu Asn Leu Ser lie Arg Glu 

325 330 335 

Leu Ala Leu Arg Leu Met Gly Asp lie Val Val Gly Thr Pro Glu Gin 
340 345 350 

Leu Ala Asp His Met Glu Ser Trp Phe Thr Gly Arg Gly Ala Asp Gly 
355 360 365 

Phe Asn lie Asp Phe Pro Tyr Leu Pro Gly Ser Ala Asp Asp Phe val 

370 375 380 

Asp His val val Pro Glu Leu Gin Arg Arg Gly Leu Tyr Arg Ser Gly 
385 390 395 400 

Tyr Glu Gly Thr Thr Leu Arg Ala Asn Leu Gly lie Asp Ala Pro Arg 
405 410 415 

Lys Ala Gly Ala Ala Ala 
420 

(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 277 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 

Met Thr Ala Pro lie Leu val Ala Thr Leu Asp Thr Arg Gly Pro Ala 
15 10 15 

Ala Thr Leu Gly Thr lie Thr Arg Ala val Arg Ala Ala Glu Ala Ala 
20 25 30 

Gly Phe Asp Ala Val Leu lie Asp Asp Arg Ala Ala Ala Gly val Gin 
35 40 45 

Gly Arg Phe Glu Thr Thr Thr Leu Thr Ala Ala Leu Ala Ala Val Thr 
50 55 60 

Glu His He Gly Leu lie Thr Ala Pro Leu Pro Ala Asp Gin Ala Pro 
65 70 ^ 75 80 

Tyr His Val Ser Arg lie Thr Ala Ser Leu Asp His Leu Ala His Gly 
85 90 95 
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Arg Thr Gly Trp Leu Ala Ser Thr Asp Thr Thr Asp Pro Glu Gly Arg 
100 105 110 

Thr Gly Glu Leu lie Asp val val Arg Gly Leu Trp Asp Ser Phe Asp 
115 120 125 

Asp Asp Ala Phe val His Asp Arg Ala Asp Gly Leu Tyr Trp Arg Leu 
130 135 140 

Pro Ala val His Gin Leu Asp His Gin Gly Arg His Phe Asp val Ala 
145 150 155 160 

Gly Pro Leu Asn val Ala Arg Pro Pro Gin Gly His Pro Val Val Ala 
165 170 175 

val Thr Gly Pro Ala Leu Ala Ala Ala Ala Asp Leu val Leu Leu Asp 
180 185 190 

Glu Ala Ala Asp Ala Ala Ser val Lys Gin Gin Ala Pro His Ala Lys 

195 200 205 

lie Leu Leu Pro Leu Pro Gly Pro Ala Ala Glu Leu Pro Ala Asp Ser 
210 215 220 

Pro Ala Asp Gly Phe Thr val Ala Leu Thr Gly ser Asp Asp Pro val 
225 230 235 240 

Leu Ala Ala Leu Ala Ala Arg Pro Gly Arg Pro Asp Arg Thr Ala Ala 

245 250 255 

Thr Thr Leu Arg Glu Arg Leu Gly Leu Ala Arg Pro Glu Ser Arg His 
260 265 270 

Ala Leu Thr Thr Ala 
275 

(2) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 402 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) topology: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 

Met ser Arg Arg Leu Phe Thr ser Glu ser val Thr Glu Gly His Pro 
15 10 15 

Asp Lys lie Ala Asp Gin lie Ser Asp Thr val Leu Asp Ala Leu Leu 
20 25 30 

Arg Glu Asp Pro Ala Ser Arg val Ala val Glu Thr Leu lie Thr Thr 
35 40 45 

Gly Gin val His lie Ala Gly Glu val Thr Thr Lys Ala Tyr Ala Pro 
50 55 60 

He Ala Gin Leu Val Arg Asp Thr lie Leu Ala lie Gly Tyr Asp Ser 
65 70 75 80 
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ser Ala Lys Gly Phe Asp Gly Ala Ser cys Gly val Ser val Ser lie 
85 90 95 

Gly Ala Gin Ser Pro Asp He Ala Gin Gly val Asp Ser Ala Tyr Glu 
100 105 110 

Thr Arg val Glu Gly Glu Asp Asp Glu Leu Asp Gin Gin Gly Ala Gly 
115 120 125 

Asp Gin Gly Leu Met Phe Gly Tyr Ala Thr Asp Glu Thr Pro ser Leu 
130 135 140 

Met Pro Leu Pro lie Glu Leu Ala His Arg Leu ser Arg Arg Leu Thr 

145 150 155 160 

Glu Val Arg Lys Asp Gly Thr Val Pro Tyr Leu Arg Pro Asp Gly Lys 
165 170 175 

Thr Gin val Thr lie Glu Tyr Gin Gly Ser Arg Pro val Arg Leu Asp 
180 185 190 

Thr val val val Ser ser Gin His Ala Ala Asp lie Asp Leu Gly Ser 

195 200 205 

Leu Leu Thr Pro Asp lie Arg Glu His val val Glu His val Leu Ala 
210 215 220 

Ala Leu Ala Glu Asp Gly lie Lys Leu Glu Thr Asp Asn Tyr Arg Leu 
225 230 235 240 

Leu Val Asn Pro Thr Gly Arg Phe Glu lie Gly Gly Pro Met Gly Asp 

245 250 255 

Ala Gly Leu Thr Gly Arg Lys lie lie lie Asp Thr Tyr Gly Gly Met 
260 265 270 

Ala Arg His Gly Gly Gly Ala Phe Ser Gly Lys Asp Pro Ser Lys Val 
275 280 285 

Asp Arg Ser Ala Ala Tyr Ala Met Arg Trp val Ala Lys Asn val val 

290 295 300 

Ala Ala Gly Leu Ala Ser Arg Cys Glu Val Gin val Ala Tyr Ala lie 
305 310 315 320 

Gly Lys Ala Glu Pro val Gly Leu Phe val Glu Thr Phe Gly Thr Gly 
325 330 335 

Thr val Ala Gin Glu Arg lie Glu Lys Ala lie Thr Glu val Phe Asp 
340 345 350 

Leu Arg Pro Ala Ala lie lie Arg Asp Leu Asp Leu Leu Arg Pro lie 
355 360 365 

Tyr Ala Ala Thr Ala Ala Tyr Gly His Phe Gly Arg Glu Leu Pro Asp 
370 375 380 

Phe Thr Trp Glu Arg Thr Asp Arg Ala His Arg Leu Lys Ala Ala Ala 
385 390 395 400 

Gly Leu 
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(2) INFORMATION FOR SEQ ID NO: 20: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 582 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) topology: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20: 



Met Leu 
1 

Asp Gly Cys val Pro Trp Pro Glu Asp val Ala Ala Lys Tyr Arg Ala 
5 10 15 

Ala Gly Tyr Trp Arg Gly Glu Pro Leu Gly Met Leu Leu Gly Arg Trp 
20 25 30 

Ala Glu Gin Tyr Gly Glu Arg Glu Ala Leu val Gly Ala Asp Gly Cys 
35 40 45 50 

Ser Arg val Thr Tyr Arg Ala Leu Asp Arg Trp Cys Asp Arg Leu Ala 
55 60 65 

Ala Gly Phe Ala Ala Arg Gly lie Gly Ala Gly Glu Arg val Leu val 
70 75 80 

Gin Leu Pro Asn Thr Pro Glu Phe val Ala Val Cys Phe Ala Leu Phe 
85 90 95 

Arg Leu Gly Ala Leu Pro val Phe Ala Leu Pro Ala His Arg Ala Ala 
100 105 110 

Glu val Gly His Leu Leu Glu Leu Ser Gly Ala Val Ala His lie Leu 
115 120 125 130 

Pro Gly Thr Gly Thr Gly Tyr Asp His Val Ala Ala Ala Val Glu Ala 
135 140 145 

Arg Ala Arg Arg Ala Arg Pro Val Gin Val Phe Val Ala Gly Glu Ala 
150 155 160 

Pro Ala val Leu Pro Glu Gly Phe Thr Ala Leu Ala Asp Val Asp Gly 
165 170 175 

Asp Pro val Ala Pro Ala Asp val Asp Ala Phe Arg Arg Gly val Phe 
180 185 190 

pt 

Leu Leu Ser Gly Gly Thr Thr Ala Leu Pro Lys Leu lie Pro Arg Thr 
195 200 205 210 

His Asp Asp Tyr Ala Tyr Gin Cys Arg val Thr Ala Gly lie Cys Gly 
215 220 225 

Leu Asp Ala Asp Ser Val Tyr Leu Ala val Leu Pro Ala Glu Phe Asn 
230 235 240 

Phe Pro Phe Gly Cys Pro Gly lie Leu Gly Thr Leu His Ala Gly Gly 
245 250 255 
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Arg val val Phe Ala Leu Sen Pro Gin Pro Glu Glu cys Phe Ala Leu 
260 265 270 

lie Glu Arg Glu His val Thr Phe Thr Ser val lie Pro Thr lie val 
275 280 285 290 

His Leu Trp Leu Ala Ala Ala Ala Gin Gly His Gly Arg Asp Leu Gly 
295 300 305 

Ser Leu Gin Leu Leu Gin val Gly ser Ala Lys Leu. His Glu Glu Leu 
310 315 320 

Ala Ala Arg He Gly Pro Glu Leu Gly Val Arg Leu Gin Gin val Phe 

325 330 335 

Gly Met Ala Glu Gly Leu Leu Thr Phe Thr Arg Asp Asp Asp pro Ala 
340 345 350 

Asp Val val Leu Arg Thr Gin Gly Arg Pro val Ser Glu Ala Asp Glu 
355 360 365 370 

lie Arg val Ala Asp Pro Asp Gly Arg Pro val Pro Arg Gly Glu Thr 

375 380 385 

Gly Glu Leu Leu Thr Arg Gly Pro Tyr Thr Leu Arg Gly Tyr Tyr Arg 
390 395 400 

Ala Pro Glu His Asn Ala Arg Ala Phe Thr Glu Asp Gly Phe Tyr Arg 
405 410 415 

Ser Gly Asp Leu val Arg Leu Thr Ala Asp Gly Gin Leu val val Glu 

420 425 430 

Gly Arg lie Lys Asp Val val He Arg Gly Gly Asp Lys val Ser Ala 
435 440 445 450 

Thr Glu val Glu Gly His Leu Gly Ala His Pro Asp val Gin Gin Ala 
455 460 465 

Ala val val Ala Met Pro Asp Pro val Trp Gly Glu Lys val cys Ala 
470 475 480 

Tyr lie Val Pro Ala Pro Gly Arg Pro Ala Pro Pro Met Ala Ala Leu 
485 490 495 

Arg Arg Leu Leu Arg Ala Arg Gly Leu Ala Asp Tyr Lys Leu Pro Asp 
500 505 510 

Arg Val Glu val val Asp Ala Phe Pro Leu Thr Gly Leu Asn Lys val 
515 520 525 530 

Asp Lys Lys Ala Leu Ala Ala Asp lie Ala Ala Lys Thr Ala Pro Thr 
535 540 545 

Arg Pro Thr Thr Ala Gly His Gly Pro Thr Thr Asp Gly Asp Thr Ala 

550 555 560 

Gly Gly Gly Gly Ser Ala Gly Gly val Thr Ala Ala Gly Gly Gly Arg 
565 570 575 

Glu Glu Ala Ala 
580 
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(2) INFORMATION FOR SEQ ID NO: 21: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 528 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) topology: linear 



(xi) SEQUENCE description: SEQ ID NO: 21: 



Met Arg Thr Ser 
1 



Arg Ser His Asp Gin Arg Ala Pro Thr Pro Trp Arg His Pro Leu His 
5 10 15 20 

Ser Thr Arg Pro Ala Pro Ala Ala Asp Arg Asp Pro Arg Arg Trp val 
25 30 35 

lie Leu Gly val lie Cys Leu Ala Gin Leu val val Leu Leu Asp Asn 
40 45 50 

Thr Val Leu Asn val Ala lie Pro val Leu Thr Thr Asp Leu Gly Ala 
55 60 65 

Ser Thr Ala Asp lie Gin Trp Met lie Asn Ala Tyr Ala Leu val Gin 
70 75 80 

Ser Gly Leu Leu Leu Thr Ala Gly ser Leu Ala Asp Arg Tyr Gly Arg 
85 90 95 100 

Lys Arg Leu Leu Met Leu Gly Leu val Leu Phe Gly Ala Gly ser Ala 

105 110 115 

Trp Ala Ala Phe Ala Gin Asp Ser Ala Gin Leu lie Ala Ala Arg Ala 
120 125 130 

Gly Met Gly val Gly Gly Ala Leu Leu Ala Thr Thr Thr Leu Ala val 
135 140 145 

lie Met Gin val Phe Asp Asp Asp Glu Arg Pro Arg Ala lie Gly Leu 

150 155 160 

Trp Gly Ala Ala Ser ser Leu Gly Phe Ala Ala Gly Pro Leu Leu Gly 
165 170 175 180 

Gly Ala Leu Leu Asp His Phe Trp Trp Gly Ser lie Phe Leu lie Asn 
185 190 195 

Leu Pro Val Ala Leu Leu Gly Leu Leu Ala val Ala Arg Leu val Pro 
200 205 210 

Glu Thr Lys Asn Pro Glu Gly Arg Arg Pro Asp Leu Leu Gly Ala val 
215 220 225 

Leu Ser Thr Leu Gly Met val Gly Val Val Tyr Ala lie lie Ser Gly 
230 235 240 

Pro Glu His Gly Trp Thr Ala Pro Gin val Leu Leu Pro Ala Ala val 
245 250 255 260 

Ala Ala Ala Ala Leu Thr Ala Phe val Arg Trp Glu Leu His Thr Pro 
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265 270 275 

His Pro Met Leu Asp Met Gly Phe Phe Thr Asp Arg Arg Phe Asn Gly 
280 285 290 

Pro Ser Pro Ala Glu Cys Ser ser Phe Gly Met Ala Gly Ser Leu Phe 

295 300 305 

Leu Leu Thr Gin His Leu Gin Leu Val Leu Gly Tyr Asp Ala Leu Gin 
310 315 320 

Ala Gly Leu Arg Thr Ala Pro Leu Ala Leu Thr lie val Ala Leu Asn 
325 330 335 340 

Leu Ala Gly Leu Gly Ala Lys Leu Leu Ala Ala Leu Gly Thr Ala Arg 

345 350 355 

Ser lie Ala Leu Gly Met Thr Leu Leu Ala Ala Gly Leu ser Ala val 
360 365 370 

Ala Val Gly Gly Ser Gly Pro Asp Ala Gly Tyr Gly Gly Met Leu Ala 
375 380 385 

Gly Leu Leu Leu Met Gly Ala Gly lie Ala Leu Ala Met Pro Ala Met 
390 395 400 

Ala Thr Ala val Met Ser Ser lie Pro Pro Ala Lys Ala Gly Ala Gly 
405 410 415 420 

Ala Gly val Gin Gly Thr Leu Thr Glu Phe Gly Gly Gly Leu Gly val 
425 430 435 

Ala lie Leu Gly Ala Val Leu Gly Ser Arg Phe Ala Ser Gin Leu Pro 
440 445 450 

Ala Ala lie Thr Gly Thr Gly ser Leu Asp Glu Ala Leu Arg Asp Ala 
455 460 465 

Thr Pro Gin Gin Ala Gly Gin val His Asp Ala Phe Ala Asp Ala Val 
470 475 480 

Asn Thr Ser Gin Leu lie Gly Ala Ala Ala val Phe Thr Gly Gly Leu 
485 490 495 500 

Leu Ala Ala Leu Leu Leu His Arg Ala Asp Arg Lys Ala Ala Pro Gin 
505 510 515 

Pro Thr Ala Pro Thr Pro Glu Pro Thr Thr Thr Ala 
520 525 

(2) INFORMATION FOR SEQ ID NO: 22: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 161 amino acids 

(B) TYPE: amino acid 

(C) STRANDEONESS : single 

(D) topology: linear 

Cxi) SEQUENCE description: SEQ ID NO: 22: 

Val Thr Gly Ala Asp Asp Pro 
1 5 
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Ala Arg Pro Ala val Gly Pro Gin Ser Phe Arg Asp Ala Met Ala Gin 
10 15 20 

Leu Ala ser Pro val Thr val val Thr val Leu Asp Ala Ala Gly Arg 

25 30 35 

Arg His Gly Phe Thr Ala Gly Ser val Val Ser val Ser Leu Asp Pro 
40 45 50 55 

Pro Leu val Met val Gly lie Ala Leu Thr ser ser Cys His Thr Ala 
60 65 70 

Met Ala Ala Ala Ala Glu Phe cys val ser lie Leu Gly Glu Asp Gin 
75 80 85 

Arg Ala val Ala Lys Arg Cys Ala Thr His Gly Ala Asp Arg Phe Ala 
90 95 100 

Gly Gly Glu Phe Ala Ala Trp Asp Gly Thr Gly Val Pro Tyr Leu Pro 
105 110 115 

Asp Ala Lys Val Val Leu Arg Cys Arg Thr Thr Asp val val Arg Ala 
120 125 130 135 

Gly Asp His Asp Leu val Leu Gly Thr Pro val Glu lie Arg Thr Gly 
140 145 150 

Asp Pro Ala Lys Pro Pro Leu Leu Trp Tyr 
155 160 

(2) INFORMATION FOR SEQ ID NO: 23: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 213 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) topology: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23: 

Ala Thr Ala Arg Leu lie Gly Pro Leu Pro Arg Arg Leu Gly Leu Gin 
15 10 15 

Val His Gin val Met Thr Gly Ala Phe Ala Gin Ala Leu Ala Arg Trp 
20 25 30 

Arg Gly Ser Arg Ala val Thr Phe Asp Val Glu Thr His Gly Arg His 
35 40 45 

Gly Arg Asp Glu Leu Phe Arg Thr Val Gly Trp Phe Thr Ser lie His 
50 55 60 

Pro val val Leu Gly Ala Asp Arg Ser val His Pro Glu Gin Tyr Leu 
65 70 75 80 

Ala Gin lie Gly Ala Ala Leu Thr Ala Ala Pro Asp Gly Gly val Gly 
85 90 95 

Phe Gly Ala Cys Arg Glu Phe Ser Pro Asp Ala Gly Leu Arg Thr Leu 
100 105 110 

Leu Arg Asp Leu Pro Pro Ala Leu val Cys Phe Asn Tyr Tyr Gly Gin 
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115 120 125 

Ala Asp Gin Leu Ser Pro Asn Gly Gly Phe Arg Met Ser Gly Arg Pro 
130 135 140 

lie Pro Arg Glu His Ser Ala Arg Cys Glu Arg val Tyr Gly lie Glu 
145 150 155 160 

Val Tyr Gly lie val His Gly Gly Arg Leu Arg Met Gly Leu Thr Trp 
165 170 175 

val Pro Ser Pro Ala Asp Gly val Asp Glu Ala Gly val Asp Ala Leu 
180 185 190 

val Glu Gin Met Ser Trp val Leu Ala Thr Leu Ala Gly Ala Asp Pro 
195 200 205 

His Ala val Thr Pro 
210 

(2) INFORMATION FOR SEQ ID NO: 24: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 195 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) topology: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24: 

Val Arg Thr val Arg Thr Leu Leu lie Asp Asn Tyr Asp Ser Phe Thr 
1 5 10 15 

Tyr Asn Leu Phe Gin Met Leu Ala Glu val Asn Gly Ala Ala Pro Leu 
20 25 30 

val val Arg Asn Asp Asp Thr Arg Thr Trp Gin Ala Leu Ala Pro Gly 
35 40 45 

Asp Phe Asp Asn Val Val Val ser Pro Gly Pro Gly His Pro Ala Thr 
50 55 60 

Asp Thr Asp Leu Gly Leu Ser Arg Arg val lie Thr Glu Trp Asp Leu 
65 70 75 80 

Pro Leu Leu Gly val Cys Leu Gly His Gin Ala Leu Cys Leu Leu Ala 
85 90 95 

Gly Ala Ala val val His Ala Pro Glu Pro Phe His Gly Arg Thr ser 
100 105 110 

Asp lie Arg His Asp Gly Gin Gly Leu Phe Ala Asn lie Pro Ser Pro 
115 120 125 

Leu Thr Val Val Arg Tyr His Ser Leu Thr Val Arg Gin Leu Pro Ala 
130 135 140 

Asp Leu Arg Ala Thr Ala His Thr Ala Asp Gly Gin Leu Met Ala val 
145 150 155 160 

Ala His Arg His Leu Pro Arg Phe Gly val Gin Phe His Pro Glu Ser 
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165 170 175 

lie Ser Ser Glu His Gly His Arg Met Leu Ala Asn Phe Arg Asp Leu 

90 



180 185 19C 



Ser Leu Arg 
195 



(2) INFORMATION FOR SEQ ID NO: 25: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 292 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25: 

val Thr Ala Ala Ala Pro Thr Leu Ala 
1 5 

Gin Ala Leu Asp Glu Ala Thr Gly Gin Leu Thr Gly Ala Gly lie Thr 
10 15 20 25 

Ala Asp Ala Ala Arg Ala Asp Thr Arg Leu Leu Ala Ala His Ala Cys 
30 35 40 

Gin Val Ala Pro Gly Asp Leu Asp Thr Cys Leu Ala Gly Pro Val Pro 
45 50 55 

Pro Arg Phe Trp His Tyr Val Arg Arg Arg Leu Thr Arg Glu Pro Ala 
60 65 70 

Glu Arg lie val Gly His Ala Tyr Phe Met Gly His Arg Phe Asp Leu 
75 80 85 

Ala Pro Gly Val Phe val Pro Lys Pro Glu Thr Glu Glu lie Thr Arg 
90 95 100 105 

Asp Ala lie Ala Arg Leu Glu Ala Leu val Arg Arg Gly Thr Thr Ala 
110 115 120 

Pro Leu val val Asp Leu Cys Ala Gly Pro Gly Thr Met Ala val Thr 
125 130 135 

Leu Ala Arg His val Pro Ala Ala Arg val Leu Gly lie Glu Leu Ser 
140 145 150 

Gin Ala Ala Ala Arg Ala Ala Arg Arg Asn Ala Arg Gly Thr Gly Ala 
155 160 165 

Arg lie Val Gin Gly Asp Ala Arg Asp Ala Phe Pro Glu Leu Ser Gly 
170 175 180 185 

Thr Val Asp Leu Val val Thr Asn Pro Pro Tyr lie Pro lie Gly Leu 
190 195 200 

Arg Thr Ser Ala Pro Glu val Leu Glu His Asp Pro Pro Leu Ala Leu 

205 210 215 

Trp Ala Gly Glu Glu Gly Leu Gly Met lie Arg Ala Met Glu Arg Thr 
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220 225 230 

Ala Ala Arg Leu Leu Ala Pro Gly Gly Val Leu Leu Leu Glu His Gly 
235 240 245 

Ser Tyr Gin Leu Ala ser val Pro Ala Leu Phe Arg Ala Thr Gly Arg 
250 255 260 265 

Trp Ser His Ala Ser ser Arg Pro Thr Cys Asn Asp Gly Cys Leu Thr 
270 275 280 

Ala Val Arg Asn His Thr Cys Ala Pro Pro Ala 
285 290 

(2) INFORMATION FOR SEQ ID NO: 26: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 75 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 26: 

lie Phe Glu His Lys Thr val Ala Gin Leu Ala Pro val Ala Glu 
15 10 15 

Thr Leu Ala Asp Thr Thr Arg Glu Glu Pro Ala Ala Val Ala Ala Thr 

20 25 30 

Gly Asp val Pro Leu Thr Pro lie Met His Trp Leu Arg Glu Arg Gly 
35 40 45 

Gly Pro val Asp Ala Phe Ser Gin Thr Met Ala val Thr val Pro Ala 
50 55 60 

Gly Leu Asp Arg Glu Arg Leu val Ala Ala Leu Gin 
65 70 75 



(2) INFORMATION FOR SEQ ID NO: 27: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 82 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) topology: linear 

(xi) SEQUENCE description: SEQ ID NO: 27: 

Leu Glu Tyr Asp Thr Ala Leu Tyr Glu Arg Ala Thr Ala Glu Ala Leu 
15 10 15 

Thr Gly Arg Leu Leu Arg Leu Leu Asp Ala Val Val Thr Asp Pro Gin 
20 25 30 

Ala Pro val Gly Ser His Asp Leu Leu Glu Glu Ala Glu His Ala Arg 
35 40 45 

Leu Ala Ala Phe Asn Asp Thr Ala Arg Pro val Pro Arg Ala Gly Leu 
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50 55 60 

Ala Glu Leu Phe Thr Ala Gin Ala Arg Arg Thr Ala Asp Ala val Ala 
65 70 75 80 

val val 



(2) INFORMATION FOR SEQ ID NO: 28: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 63 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 28: 

Met Pro Pro Val Thr Pro Tyr Arg Ala Tyr Leu Ala His Leu Ala 
15 10 15 

Gly Arg Asp Asp Asp Ala Ala Arg Ala Ala Trp Arg Thr Ala Leu Ala 

20 25 30 

Asp Leu Glu Glu Pro ser Leu val Ala Gly Ala Gly Ala Gly Arg Gly 
35 40 45 

Ala Ala Asp Gly Ser Ala Leu Pro Gly Gin lie Pro Gly Tyr Arg Ala 
50 55 60 

(2) INFORMATION FOR SEQ ID NO: 29: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 158 amino acids 

(B) TYPE: amino acid 

(c) STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 29: 

Leu Gin val Glu Gly Arg Pro Ala His Leu Glu Leu Pro Cys Asp His 
1 5 ,10 15 

Pro Arg Pro Ala Val Ala Thr His Arg Gly Ala Thr Val Pro Phe His 
20 25 30 

lie Asp Ala Gly Leu His Glu Lys Leu Thr Ala Leu Ser Lys Ala Cys 
35 40 45 

Asp ser ser Leu Phe Met Val Leu Gin Ala Ala val Ala Ala Leu Leu 

50 55 60 

Thr Arg His Gly Ala Gly Thr Asp He Pro Val Gly Ser Pro val Ala 
65 70 75 80 

Gly Arg Thr Asp Asp Ala Leu Asp Asp Leu Val Gly Phe Phe Val Asn 
85 90 95 

Thr Leu Val Leu Arg Thr Asp Thr ser Gly Asp Pro Thr Phe Arg Glu 
100 105 110 
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Leu Val Ala Arg val Arg Gin Phe Asp Leu Ala Ala Tyr Thr His Gin 

115 120 125 

Asp Met Pro Phe Glu Lys Leu val Glu Glu val Asn Pro Glu Arg ser 

130 135 140 

Leu Ala Arg Asn Pro Leu Phe Gin val val Leu Ala Leu Gin 

145 150 155 



(2) INFORMATION FOR SEQ ID NO: 30: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 161 amino acids 

(B) TYPE: amino acid 

(c) STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 30: 

Met Pro Arg Ser Leu Asp Leu Tyr val Ala Leu Leu Ala Val Leu 
15 10 15 

Lys Thr Gly Ala Ala Tyr Leu Pro val Asp lie Ser Tyr Pro Ala Glu 
20 25 30 

Arg lie Ala Phe Met lie Glu Asp Ala Arg Pro val Thr val Leu Asp 
35 40 45 

Arg Leu Pro Asp Asp Leu Gly Ala Tyr Arg Asp Thr Asp Leu Thr Asp 
50 55 60 

Ala Asp Arg Thr Ala Pro Leu Arg Pro Glu His Pro Ala Tyr val lie 
65 70 75 

His Thr Ser Gly Ser Thr Gly Thr Pro Lys Ala val val Met Pro His 
80 85 90 95 

Ala Gly Leu val Asn Leu Leu Thr Trp His Ala Arg Arg Phe Pro Gly 
100 105 110 

Gly Thr Gly val Arg Thr Ala Gin Phe Thr Ala lie Gly Phe Asp Phe 
115 120 125 

Ser val Gin Glu lie Leu Ser Pro Leu val Met Gly Lys Thr Leu Ala 
130 135 140 

val Pro Ser Glu Glu val Arg His Ser Ala Glu Leu Leu Ala Gly Trp 
145 150 155 

Leu Glu 
160 



(2) INFORMATION FOR SEQ ID NO: 31: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 97 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 
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(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 31: 

Leu Gin Ala Glu Gly Ala Glu Val Ser Leu Leu Ala val Leu Asp Gly 
15 10 15 

Tyr Pro Asp Ala Tyr Asp Gly Thr Glu His Glu val Gly Glu Glu Gin 
20 25 30 

val Leu Ala lie Leu Leu Asn Ala Ala Gly val Asp Arg Ala Gin Ala 
35 40 45 

Phe Gly Asp Ala Pro Leu Gin Arg Ala Ala val Leu Glu Lys Leu Arg 
50 55 60 

Asp Ser Gly Ser Ala Leu Gly Asn Leu Asp Asp Asp Ala Val Gly Ar 
65 70 75 8 

Met val Thr Val Phe Leu Asn Asn Thr Arg Leu lie Gin Asn Phe Arg 
85 90 95 

Pro 

(2) INFORMATION FOR SEQ ID NO: 32: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 
(c) STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(xi) SEQUENCE description: SEQ ID NO: 32: 

ATC GAY TTY CCN TAY CTS CCS GG 23 

(2) INFORMATION FOR SEQ ID NO: 33: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 
(c) STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 33: 

TTC GAC GAY GAY GCN TTC GTS CAY GAC 27 

(2) INFORMATION FOR SEQ ID NO: 34: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 34: 
GTS CCS TGG CCS GAG GAC GTS GCS GCS AAG TAC 
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(2) INFORMATION FOR SEQ ID NO: 35: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 44 base pairs 

(B) TYPE: nucleic acid 
(c) STRANDEDNESS: Single 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 35: 

GAG GTS GAG GGS CAC CTS GGS GCS CAC CCS GAC GTS CAG CAG GC 



(2) INFORMATION FOR SEQ ID NO: 36: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 amino acids 

(B) TYPE: amino acid 

CO STRANDEDNESS: single 
(D) topology: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 36: 

Val Pro Ala Ala Phe Val Pro Leu Asp Ala Leu Pro Leu Thr Gly Asn 
15 10 15 

Gly val Leu Asp 
20 



(2) INFORMATION FOR SEQ ID NO: 37: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 base pairs 

(B) TYPE: nucleic acid 
(c) STRANDEDNESS: single 
(D) topology: linear 

(xi) SEQUENCE description: SEQ ID NO: 37: 

GCS GCS TTC AAC GAC ACS GCS CGS CC 26 



(2) INFORMATION FOR SEQ ID NO: 38: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) topology: linear 

(xi) SEQUENCE description: SEQ ID NO: 38: 
TTC GTS CCS CTS GAC GCS CTS CCS CT 26 



(2) INFORMATION FOR SEQ ID NO: 39: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: Single 
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(D) TOPOLOGY: linear 

Cxi) SEQUENCE description: SEQ ID NO: 39: 
GTS ACS CCS TAC CGS GCS TAC 



(2) INFORMATION FOR SEQ ID NO: 40: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 40: 
ACS CGB CTS ATC CAG AAC TTC CGB CC 



(2) INFORMATION FOR SEQ ID NO: 41: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 41: 
TTC CGS GAC GCS ATG GCS CAG CTS GC 



(2) INFORMATION FOR SEQ ID NO: 42: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 38 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 42: 
TTC GCS GGS GGS GAG TTC GCS GCS TGG GAC GGC ACC GG 



(2) INFORMATION FOR SEQ ID NO: 43: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 43: 
GAC CCS GCS AAG CCS CCS CTS CTS TGG TAC CG 
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